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ABSTRACT 

This report is part of the National Assessment on 
what children know and can do in science. The results are reported in 
terms of percentages of respondents who provide correct responses to 
specific exercises. Among the interesting results are the following: 
(1) Males performed better than females, increasingly through the 
four National Assessment ages; (2) Females did appreciably better 
than rrales on exercises involving knowledge of human birth or 
reproduction; (3) At all four ages, respondents from the Southeast 
were successful less often than the respondents from the country as a 
whole; (4) The other three regions were similar to one another in 
typical performance, with some regional variations; and (5) 
Individuals living in urban fringe areas scored the highest and those 
in large cities the lowest. Appendices include: Definitions of Age, 
Region and Size of Community; National Percentages of Success; 

Effects for Region, Size of Community, and Sex for the Science 
Exercises; and Standard Errors for Region, Size-of— Community, and Sex 
Effects. (BB) 
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SUMMARY 



National Assessment's main purpose is to measure change in 
what children and young adults know and can do. 

The first step toward providing this information for science 
was National Assessment Report No. 1/ * which gave the national 
percentages of success for about half the science exercises 
administered during the first science assessment, conducted 
in 1969-70. 

The next step toward answering questions about progress can- 
not be taken until science is assessed again in 1972-73. 

After that, reports of science results will emphasize what 
seems to have been learned better, and what less well. In 
the meantime, our reports are confined to the results of the 
only administration of science exercises conducted so far. 

This report examines differences between sexes, among four 
geographical regions, and among four sizes of community.** 

The results are reported in terms of percentages of respon- 
dents who provide correct responses to specific exercises. 

The most interesting differences in percent correct are: 

(Sex) 

A. Males performed better than females, increasingly through 
the four National Assessment ages (9, 13, 17 and young 
adults between 26 and 35) . Almost all of this increase 
came from exercises calling on knowledge of physical 
science. For exercises on biological science, adult 
males did 2% better than adult females, while the cor- 
responding difference for exercises on physical science 
was more than 14%. (At age 17, the differences were 
about 1.5% and 4% respectively.) 

B. Females did appreciably better than males on exercises 
involving knowledge of human birth or reproduction. 

* National Assessment Report 1. 1969-1970 Science; National 

Results and Illustrations of Group Comparisons . Superintendent 
of Documents, U.S. Government Printing Office, Washington, 

D.C. 20402, $1.75. 

** These groups are defined in Appendix B. 



(Region) 



C. At all four ages, respondents from the Southeast were 
successful about 5% less often than respondents from the 
country as a whole. 

D. The other three regions were similar to one another in 
typical performance, with some advantage for the North- 
eastern and Central regions at ages 9 and 13, for the 
Northeast and West at age 17, and for the West for young 
adults . 

(Size of Community) 

E. Respondents living in Urban Fringes did about 3% better 
than the country as a whole, Medium-Size Cities came 
next, then Smaller Places, and then Big Cities (roughly 
3.5% below the national performance) . 

F. The relative position of Smaller Places and Big Cities 
changed somewhat with age. At ages 9 and 13, Smaller 
Places fell only 1% below the country as a whole, while 
Big Cities fell nearly 5% below. At ages 17 and 26 to 
35, Smaller Places and Big Cities did about the same, 
roughly 2.5% below the country as a whole. 

(Objectives) 

G. Whether comparisons were based on sex, region, or size 
of community, the typical differences for exercises 
assigned to Objective 2 (Possess the abilities and skills 
needed to engage in the processes of science) tended to 
be a little more extreme (about 1%) than those associated 
with Objective 1 (Know the fundamental facts and prin- 
ciples of science) . 



LIMITATIONS 



Interpretation 

This report considers three of the ways in which the results 
of the first Science report can be divided: sex, region and 

size of community. A later report will also consider certain 
other ways, including parental education, respondent's color 
and type of community. 

In the present report, sex, region, and size of community 
are analyzed separately. Consider a region, say the South- 
east, and an age, say 9. The figures given here are the best 
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figures we know how to extract to show how 9-year-olds from 
the Southeast perform in comparison with all 9-year-old groups. 

For those concerned with action to improve or maintain edu- 
cational progress, these same figures illuminate simple 
facts, for example how 9-year-old performance in the South- 
east compares with that for the whole country. 

There is a kind of interpretation that should never be made 
on the basis of the sort of figures given in this report. 

The fact that figures reflect Southeast performance or Big 
City performance does not mean that the performances thus 
reflected have arisen precisely from living in the South- 
east or in a Big City, or from the attitudes, techniques, 
facilities and staffs of the school system involved. 

in particular, just what happens in a region involves other 
things than that region's schools. Larger fractions of the 
children in some regions belong to a particular size-of- 
community group. Thus effects due only to size of community 
can appear to be regional differences. Larger or smaller 
fractions of the parents in some regions have particular 
amounts of education. Thus effects due only to parental ed- 
ucation can appear to be regional differences. And so on. 
Migration from one region or size of community to another 
can further complicate the picture. There are such diffi- 
culties, some of which we know how to adjust for, and some 
of which we do not. 

It is important for us to distinguish between an interest 
in causes and an interest in what the present situation is. 

To guide readers who want to think about causes, numbers 
usually have to be found by looking at (or considering) com- 
binations of several classifications. Such numbers will be 
more appropriate for thinking about causes than the sort of 
number given in the present report, although they may still 
be far from perfect. (Such numbers will be given in a later 
report.) To guide readers who want to compare today's sit- 
uations, as they stand, say region versus region or one size 
of community versus another, it is appropriate to look at 
regions separately or at sizes of communities separately, 
which has been done in this present report. This distinction 
between causes and present situations is important. Readers 
should be careful to understand it and then keep it in mind. 
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Coverage 



The analyses of this report cover all exercises for which 
adequate scorings were available for machine analysis by 
January 22, 1971. Certain other exercises (up to five per 
age) will have adequate scorings available for in- 
clusion in later reports. There is no reason to expect that 
this absence of at most 4% of the exercises appreciably 
alters the overall impact of the present report. 
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METHODS 



This report presents results for respondents at four ages 
(9, 13, 17, and young adults) classified in three different 

ways: males and females, residents of four different geo- 

graphical regions of the nation, and residents of communities 
of four different population sizes. (Appendix B defines 
region and size of community.) Our concern is to describe 
how well these groups of respondents do on science exercises. 

In the interests of clarity and ease, we want to make as few 
numerical comparisons as will adequately describe what we 
are to look at. When there are only two groups, as with sex 
divided into male and female, one numerical comparison will 
suffice. We need only compare one group with the other. 

For sex we will use the difference 




For example, one exercise given at age 9 shows 76% of the 
males and 68% of the females responding successfully. Using 
the procedure above, 

Sex = 76% - 68% = 8% . 

Difference 

When there are more than two groups, such differences are 
not as simple to use. With four groups, such as four regions 
or four community sizes, simplicity is gained by comparing 
the percentage of success of respondents in each group with 
that in the nation as a whole. We shall call the number thus 
obtained an effect . One example would be 



[ Northeast 


1 _ 


1 Northeast”! 


.1 


National 1 


{ Effect 


1 " : 


1 % Success | 


J 


% Success | 



For one exercise which 66% of 9-year— olds in the Northeast 
and 55% of all 9-year-olds answered successfully, the regional 
effect is 



Northeast 

Effect 



66 % 



55% = 11% • 
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What Should Be Described? 



For people of the same age, when we compare the performance 
of a group — in a geographic region, or from a size of com- 
munity — with all people of that age in the country, the 
results are different for each exercise — different, but not 
too different. Even though some exercises show an advantage 
for the group, others a deficit, there is often a clear 
tendency for the exercises that we are considering as a whole 
to lean one way or the other. We describe this tendency by 
reporting the typical behavior of exercises. 

Since not all exercises show the same advantage or deficit 
for the group, description of the typical behavior of all 
exercises in the class is usually not enough. We can learn 
something more from studying individual exercises for which 
the group advantage or deficit differs substantially from 
this typical behavior, if we use an appropriate rule to 
identify these a typica 1 exercises, it will be worthwhile to 
mention each of them individually, telling something of 
their nature and reporting what advantage or deficit each 
shows when the group is compared with the whole country. 

This concentration on atypical exercises is particularly 
helpful because exercises showing close to typical behavior 
are often lacking in distinctive flavor. Atypical exercises, 
by contrast, often tell us something. 



When we deal with the two sexes, we do the same things, 
except we shall compare males with females, rather than a 
group with the whole country, both in terms of typical be- 
havior and in terms of atypical exercises. 

Classes of Exercises 

Another approach to the discussion of our results is possible 
because the science exercises may be separated into classes 
by some common feature. We may investigate the behavior of 
the exercises for each class, and then ask how this behavior 
changes from one class to another. 

Only a few separations into classes are considered in this 
report. One of these is provided by content. Most exercises 
call on knowledge of either physical or biological science. 



Another classification is provided by the four science ob 
jectives :* 



1. Know fundamental facts and principles of 
science ? 

2. Possess the abilities and skills needed 
to engage in the processes of science; 

3. Understand the investigative nature of 
science ; 

4. Have attitudes about and appreciations 
of scientists, science, and the conse- 
quences of science that stem from ade- 
quate understandings. 

Although other ways of classifying the exercises might be at 
least as useful in describing sex differences, regional ef- 
fects, or community size effects, we have not found them. 

What Shall Be Typical ? 

In the interest of simplicity and clarity, we choose to sum- 
marize the behavior of a set of exercises by the median (or 
midpoint) of the differences from national results. The 
median is a summary value such that an equal number of the 
values summarized lie on either side of it. For the adult 
sample— for instance, as in Exhibit 2—1 on page 6 ——there 
were 119 exercises, for 97 of which men were more often 
right than women. There were 59 exercises where the sex 
difference favored men by more than 9.7%, and 59 where the 
sex difference was less favorable to men. (Of these latter 
59 differences/ 37 favored men by less than 9.7% and 22 
favored women.) Because equal numbers were more and less 
favorable to men, we call 9.7% the median difference and use 
it to summarize all 119 differences. 

The median is an effective typical value because (1) it is 
easy to understand and (2) it is little affected by the pres 
ence or absence of unusual values . 



* Norris, Eleanor L. (Ed.) Science obiectives . Ann Arbor, 
Michigan: Committee on Assessing the Progress of Education, 

1969. 
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Choosing Cutoffs 



Only a sample of each age group responded to each exercise, 
yet National Assessment results are concerned with whole 
populations — sometimes with all 9-year-olds in the country, 
sometimes with all 13-year-old boys, sometimes with all 17- 
year-olds in the Northeast, sometimes with all adults in 
small cities, and so on. Thus there must always be a step 
from sample to population in interpreting National Assess- 
ment results, allowing for the inescapable small differences 
between sample results and population facts. This inevitably 
raises statistical considerations — considerations which 
ought to guide our approaches, our exposition, and our state- 
ments without interfering excessively with the reporting and 
discussion of results. 

The samples taken in National Assessment were well designed 
scientific probability samples. As a result, it would be 
possible to assess how large the differences between sample 
value and population value are likely to have been for almost 
any quantity we chose to calculate from the results — for 
example, a sex difference. Results of such calculations 
have guided the choices in this report of how large a depar- 
ture from typical (median) behavior qualifies an exercise 
for separate consideration. Appendix A discusses consid- 
erations involved in, and the exact nature of, this choice. 

The chosen rule sets cutoffs determining which comparisons 
we call atypical . The cutoffs found by applying the chosen 
rule are shown by tv/o arrows on each picture of the distri- 
bution of comparisons. Exercises corresponding to compar- 
isons outside these cutoffs are discussed individually. 



MALE-FEMALE DIFFERENCES 



National Assessment was designed to allow comparisons of per- 
formance among a number of different population groups. For 
example, at every age level the performance of males on the 
science exercises may be compared with that of female's. Do 
boys and girls perform equally well on science exercises? If 
not, do differences appear for all science exercises, or only 
for certain kinds of exercises? 

To find answers to these questions, we first consider, for 
every science exercise, the difference between the percentage 
of success for males and that for females. 

The distributions of differences for all exercises, ac ages 9, 
13, 17, and adult, are presented in Exhibit 2-1. In this ex- 
hibit the length of each bar represents the number of science 
exercises that show a given difference between percentage 
correct for males and females. A difference of plus 7% indi- 
cates that 7% more males than females got the correct answer. 

A difference of minus 7 % means that 7 % fewer males than females 
got the correct answer. 

A striking feature of the distributions in Exhibit 2-1 is the 
large spread of male-female differences for adults, ranging 
from one exercise on which the correct answer is given by 35% 
more men than women to another exercise on which the difference 
is in favor of women by 25%. In sharp contrast# age 9 sex 
differences spread much less, ranging from an exercise on 
which boys do 8% better than girls to one where girls do 5% 
better than boys. 

For adults# it is clear that performance on a number of science 
exercises is decisively better for males than for females# 
while for a number of other science exercises# females perform 
much better than males. At ages 13 and 17# the range of sex 
differences is considerably smaller than for adults but still 
slightly greater than at age 9. 

The median male— female difference for all science exercises is 
shown on Exhibit 2-1 for each age level. These medians show 
a progressive increase with age — 0.5% at age 9# 1.7% at age 
13# 3.0% at age 17# and 9.7% for adults. With increasing age# 
then# there is an increasing tendency for males to perform 
better than females on science exercises. 



Male % correct less female % correct 



Exhibit 2-1 




Male-female differences in percentage correct 
for all science exercises at four age levels 



Age 

9 13 



30 • 



20 




rd 

Pm 



-20 • 



-30 J 




Adult 




No. of 

Exercises 145 

Median 0 . 5% 
difference 



* 



122 124 

1 . 7 % 3 . 0 % 



15 



6 



119 

9 . 7 % 



30 



20 



10 



0 - 



-10 



- 20 - 



-30 J 



As noted in Chapter 1, the arrows in Exhibit 2-1 show the cut- 
ting points beyond which exercises are worth individual discus- 
sion as atypical exercises. These exercises are discussed in- 
dividually in Chapter 5, where we find that the exercises on 
which male performance is a typically superior are, without 
exception, physical science exercises. For exercises assessing 
knowledge of biological science, the typical male-female dif- 
ference is small. On some biological science exercises, per- 
formance of females is better than that for males. 

The relation between age and the male— female difference on 
typical physical or biological science exercises is shown in 
Exhibit 2-2. For biological science exercises, the median sex 
difference remains small at all four age levels, rising from 
0.1% at age 9 to 2.3% for adults. The median sex difference 
for physical science exercises, on the other hand, rises from 
a modest 0.8% at age 9 to 14.2% for young adults. These results 
suggest that the increasing tendency with age for males to per- 
form better than females on all science exercises is largely 
accounted for by the exercises in physical science. At every 
age studied, males and females perform more nearly alike on 
exercises that assess knowledge of biological science. 

Among the biological science exercises at each age, one or more 
are designed to assess knowledge of human birth or reproduction. 
For example, one exercise (No. 101 at age 9 and No. 201 at age 
13) asks "Where does a human baby come from?" Another exercise 
(No. 330 at age 17 and No. 416 for adults) asks "On the average, 
in human females, the egg is released how many days after men- 
struation begins?" Also appearing at age 17 (Exercise No. 325) 
and at the adult level (Exercise No. 420) is the question "What 
is the function of the placenta in a pregnant human female?" 

For these and other similar but unreleased exercises, it is of 
interest to look at male-female performance differences. 

Exhibit 2-3 displays these male-female differences. 
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Median male-female difference 
in % correct 



Exhibit 2-2 



Median male-female differences for physical and 
biological science exercises at four age levels 



Age 

9 13 17 Adult 





17 



8 



Exhibit 2-3 



Male-female difference in percent correct for every exercise 
relating to human birth or reproduction (Negative differences 

imply females more often correct ) 



Age 



9 

(1 exercise) 


13 

(2 exercises) 


17 

(4 exercises) 


Adult 

(6 exercises) 


0% 


2% 


5% 


- 2 % 




0% 


0% 


- 2 % 






-3% 


- 4% 






-8% 


-13% 








-15% 








-25% 



The exhibit shows that for two exercises, one at age 13 and one 
at age 17, boys get the correct answer more often than girls. 

For three exercises, one at 9, one at 13, and one at 17, there 
is no sex difference in performance. For eight exercises, two 
at 17 and six at the adult level, the percentage correct for 
females is higher than that for males. It is noteworthy that 
all of the adult exercises relating to human reproduction show 
a performance difference in favor of women. Even on exercises 
referring to elements of the reproductive system in the human 
male, women give the correct answer more often than men. 

Male-female differences can also be analyzed in terms of the 
objectives the exercises were designed to assess. When this 
is done, we find that exercises for Objective 1 (Know fundamental 
facts and principles of science) and Objective 2 (Possess the 
abilities and skills needed to engage in the processes of science) 
yield median sex differences indicating a male advantage at all 
four ages. At every age, the difference in favor of males is 
at least slightly greater for the typical exercise that assesses 
Objective 2 than for the typical exercise that assesses Objective 

1 . 
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REGIONAL EFFECTS 



To what extent are the patterns of correct responses to the 
science exercises consistent from region to region of the country 
and to what extent are there patterns of response that charac- 
terize some regions and not others? Are the observed regional 
patterns consistent from one age level to another, or are there 
distinctive age patterns? Are regional differences, if they 
exist, limited to certain exercises, or are they found for all 
exercises? 

To find answers to these questions, we examine the data from 
several perspectives. As indicated in Chapter 1, a regional 
effect is the difference between the percentage correct for that 
region and the national percentage correct. If the percentage 
correct is greater for the region, we refer to regional advantage ; 
if the percentage correct is smaller for the region, we have a 
regional deficit. 

In our study of regional differences, we begin by looking at the 
distribution of the effects for all exercises at each age level 
and noting the median. Next we identify exercises for which 
the regional effects are atypical and examine them for distinctive 
characteristics. Then we consider classes of exercises as de- 
fined by science objectives and as defined by science content 
(physical science versus biological science). 

Details of the comparisons are reported in Chapter 6. In this 
chapter we concentrate on the major findings, taking the regions 
in the order Northeast, Southeast, Central, and West and then 
discussing interrelationships. 

Northeast 



The distributions of Northeast effects are presented in Exhibit 
3-1 for the four age levels. The medians of the distributions 
for ages 9, 13, and 17 are close to 2% (2.2%, 2.0%, and 2.4%). 

For these three ages, the percentage correct in the Northeast 
is about 2% higher than for the country as a whole. The median 
for the adult sample is very near zero, and there is no clear 
evidence that adults in the Northeast perform any better or worse 
than adults in the total sample on the group of exercises as a 
whole . 
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Northeast % correct less national % 



Exhibit 3-1 



Distributions of Northeast regional effects 
for all science exercises at four age levels 



Age 



9 13 17 Adult 
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No. of 
Exercises 



122 



124 



119 



Median 2 . 2 % 



2 . 0 % 



2.4 % 



0 . 7% 




20 



li 



While the differences at each age level in the Northeast are 
spread out around the medians, most of the spread is attrib- 
utable to sampling error, much more so than for the sex differ- 
ence distributions of Chapter 2. For each age level, however, 
there are some differences that cannot be accounted for by 
sampling error alone. The arrows in Exhibit 3-1 show the 
boundaries beyond which exercises are considered atypical. 
(Considerations involved in identifying typical and atypical 
exercises are discussed in Chapter 1 and Appendix A.) These 
atypical exercises are examined individually in Chapter 6, 
where we find no observable pattern except for a tendency for 
the Northeast to have a special advantage on exercises involving 
the abilities and skills needed to engage in the processes of 
science. (See the section on "classes of exercises" below.) 

Southeast 



Distributions of the Southeast effects for all exercises are 
presented in Exhibit 3-2 for the four age levels. Again the 
spreads are only somewhat greater than can be expected from 
sampling error. Whereas the median effects were all positive 
for the Northeast (about 2% at ages 9, 13, and 17), all of the 
medians for the Southeast are negative. For each age the 
Southeast performance is about 5% below that of the country as 
a whole. 

When exercises exhibiting atypical effects are examined age 
level by age level (see Chapter 6), no pattern can be seen. 
When atypical exercises are collected across age levels, how- 
ever, a pattern begins to emerge. This pattern involves the 
science objectives and is discussed below (see page 18). 

Central 



Distributions of the regional effects for the Central region 
are pictured in Exhibit 3-3. Most of the effects fall between 
the arrows marking the limits beyond which exercises are 
atypical, and the distributions are generally symmetrical. At 
age 9 and age 13 the medians are 1.6% and 1.9% respectively. 
These are not large effects, but they are larger than one would 
expect to find simply as a result of sampling fluctuations. 

At age 9, 72% of the exercises show an advantage for the Central 
region? at age 13 the Central region has an advantage on 80% 
of the exercises. 
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Exhibit 3-2 



Distributions of Southeast regional effects 
for all science exercises at four age levels 



Age 




No. of 

Exercises 145 
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Median -5.1% 
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Exhibit 3-3 

Distributions of Central regional effects 
for all science exercises at four age levels 
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For 17-year-olds and adults the picture is different. At age 
17, the median is zero and the exercises are split evenly 
between those showing an advantage and those showing a deficit. 
The median for adults (0.6%) is barely positive; there is 
insufficient evidence to conclude that there is a general advan- 
tage for this group. 

When the atypical exercises are examined (Chapter 6) , no dis- 
tinctive patterns are found. 

West 



The distributions of effects for the West are presented in 
Exhibit 3-4. At ages 17 and adult the West performs about 2% 
better than the country as a whole (1.7% and 2.6%) , while ages 
9 and 13 exhibit insignificant median effects (0.0% and -0.6%). 

A detailed examination of the 17 exercises showing atypical 
effects (Chapter 6) reveals no pattern. 

Classes of Exercises 



As indicated in Chapter 1 (page 3 ), the exercises we r 3 written 
to assess four objectives. Since only a few exercises are 
assigned to Objectives 3 and 4, the most illuminating analysis 
comes from comparing the median effects of exercises classed 
under Objectives 1 and 2. Exhibit 3-5 presents the median 
effects for these two objectives and for all exercises regard- 
less of objectives. The most striking aspect of the comparison 
of Objective 1 and 2 effects is that the effects for exercises 
classed under Objective 2 tend to be a little more extreme than 
those for Objective 1 exercises. For most combinations of region 
and age, as Exhibit 3-5 shows, the median effect for Objective 
2 exercises is about 1% larger than the median effect for Objec- 
tive 1 exercises, and both medians are in the same direction 
from the national percentage of success. For example, the 
median effect for Objective 2 exercises for West adults (2.9%) 
is 0.6% larger than the median effect for Objective 1 exercises 
(2.3%). In both cases, the effect is positive, indicating that 
West adults perform better than adults for the country as a 
whole . 

There are several exceptions to this pattern. The behavior of 
Northeast adults and West 9-year-olds is unusual in that their 
Objective 2 medians are in the opposite direction from the medians 
of Objective 1 exercises. Effects for Southeast adults are also 
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Exhibit 3-4 



Distributions of West regional effects 
for all science exercises at four age levels 
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Exhibit 3-5 



Median effects for all exercises, Objective 1 
exercises only, and Objective 2 exercises only 



Medians for 



Groun 


Age 


All Ex. 


Obj . 1 


Obj. 2 


Compar is on* 


West 


Adult 


2.6 


2.3 


2.9 


0.6 bigger 


Northeast 


17 


2.4 


2.1 


3.4 


1.3 bigger 


Northeast 
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2.2 


1.9 


2.7 


0.8 bigger 


Northeast 


13 


2.0 


1.5 


3.2 


1.7 bigger 


Central 


13 


1.9 


2.1 


2.3 


0.2 bigger 


West 


17 


1.7 


1.1 


2.6 


1.5 bigger 


Central 
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1.6 


1.3 


2.7 


1.4 bigger 


Northeast 


Adult 


0.7 


1.0 


-1.1 


2.1 across zero** 


Central 


Adult 


0.6 


0.1 


2.1 


2 . 0 bigger 


Central 


17 


0.0 


-0.3 


0.2 


0.5 across zero** 


West 


9 


0.0 


0.6 


-1.3 


1.9 across zero** 


West 


13 


-0.6 


-0.6 


-0.6 


no change** 


Southeast 


13 


-4.8 


-4.5 


-7.7 


3.2 bigger 


Southeast 


17 


-4.8 


-4.2 


-7.3 


3.1 bigger 


Southeast 


Adult 


-5.1 


-5.5 


-4.5 


1.0 smaller** 


Southeast 


9 


-5.1 


-5.1 


-5.6 


0.5 bigger 



♦Objective 2 compared to Objective 1 
♦♦Exceptions to the rule 
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an exception in that the median is more extreme for Objective 
1 exercises than for Objective 2. 

While Objective 2 deviations are generally about 1 % greater than 
those of Objective 1, there are two combinations of region and 
age where the deviation is much more conspicuous. 

For Southeast 13s and 17s, Objective 2 deviations are more than 
3 % greater. In light of this finding, the pattern observed in 
the atypical exercises across all ages for the Southeast appears 
to be a reflection primarily of the characteristics of atypical 
exercises at ages 13 and 17. The pattern consists of two parts: 

1. The tendency for exercises classified under Objective 2 to 
exhibit atypical deficits only. 

2. The tendency for Objective 1 exercises showing atypical 
advantages to differ systematically from Objective 1 
exercises showing atypical deficits. 

Altogether at ages 13 and 17 in the Southeast there are eight 
atypical exercises classified under Objective 2 and all show 
a Southeast deficit. In contrast, there are six Objective 1 
exercises showing an atypical Southeast advantage and eight 
showing an atypical deficit. The six showing an advantage tend 
to ask for knowledge that might be obtained from general expe- 
rience, for example, knowledge about automobiles or rainbows; 
those showing a deficit tend to deal with the sorts of things 
that can be learned best through formal instruction, for 
example, the size and motion of the planets. 

When exercises are sorted according to physical versus biologi- 
cal science, the regional effects are not altered enough to 
deserve mention* here. (Details are presented in Chapter 6.) 

Interrelationships 

The relationships among the median effects for the four regions 
at each of the four ages are pictured in Exhibit 3-6. At all 
four ages, there is a decided contrast between the lower per- 
formance of the Southeast and the average or above average per- 
formance of the other three regions. The Northeast and Central 
regions perform a little higher than the country as a whole 
at ages 9 and 13. At age 17 the Northeast and the West perform 
a little better. Only the West is higher at the adult level. 



Exhibit 3-6 



Median effects for the four regions at four ages 
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The marked contrast between Southeast performance and perfor- 
mance of the other three regions leads to an interesting question. 
Could the Southeast's lower performance on science exercises 
be attributed to a lack of formal instruction in science? On 
certain exercises that might be answered on the basis of out- 
of-school experience, the Southeast performs as well as or better 
than the rest of the country. But on exercises requiring sci- 
entific skills or knowledge of the abstractions and generaliza- 
tions taught in formal courses. Southeast performance, partic- 
ularly that of the 13s and 17s, is substantially lower. 

Patterns among the other three regions, while based on smaller 
differences, also bring questions to mind. Why, for example, 
is the Northeast effect for adults lower than for 9s, 13s, or 
17s? Could the difference be the result of migration of young 
adults into or out of the Northeast? Why is the performance 
of adults in the West superior to any other region and to the 
country as a whole? Could West adult superiority also reflect 
primarily a movement of people rather than factors indigenous 
to the region? 

At this stage of the assessment project, it is not possible to 
say what accounts for the patterns across regions. One can only 
look at the first year's data and raise questions. These ques- 
tions, however, can suggest additional kinds of data that should 
be collected in later phases of the project. As such data are 
gathered and related to what is already known, the picture may 
become clearer. 
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4. 



SIZE-OF-COMMUNITY EFFECTS 



One aim of National Assessment has been to trace educational V. 

progress in different types of communities. Do children •■r' 

from schools in the big cities of the nation perform better 
or worse than those from small towns or rural areas? What 
can be said about progress on science content at various 
ages in the inner cities and in the suburbs? 

This chapter is based on a size-of-community classification 
of four groups: 

Big Cities: Cities with populations over 200,000. 

Urban Fringe: Areas surrounding Big Cities. 

Medium-Size Cities: Cities with populations between 

20,000 and 200,000. 

Smaller Places: Towns with populations under 20,000 

and rural places. 

Conveniently, there are about the same number of people in 
each group. No distinction is made within Urban Fringe 
areas between affluent residential suburbs and heavily in- 
dustrialized fringe zones, nor is there separate examination 
within the Big Cities of the most economically disadvantaged 
"inner city" areas. A more elaborate classification proce- 
dure would be necessary to realize these distinctions. 

Results for some of these classifications will be presented 
in a later report. 

The performances of each of the four size-of-community (S0C) 
groups are considered in turn. The standard measure of 
relative performance on each exercise is, as in Chapter 3, 
the difference between the percentage of correct answers 
within a given group and the national percentage of correct 
answers. By summarizing over all the exercises (or certain 
subsets of exercises), we obtain a picture of the general 
performance level of each SOC group. 
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Big Cities 



The distributions of Big City effects for all exercises at 
all age levels are shown in Exhibit 4—1. All four of the 
distributions have their medians below zero, by 4.8%, 4.7%, 
2.6%, and 2.4% at ages 9, 13, 17, and young adult (26-35). 

It is clear that this Big City deficit of a few percentage 
points is statistically reliable: at age 9, there is a def- 

icit on 134 of 145 exercises (92%); at age 13, on 112 of 122 
exercises (92%) ; at age 17, on 95 of 124 (77%) ; for adults, 
on 83 of 119 (70%). The deficit appears somewhat less at 
the two upper age levels. Application of an appropriate 
statistical test indicates that this appearance is reliable, 
that there is indeed a reduced deficit in Big City perfor- 
mance from age 9 and 13 to age 17 and adult. This reduced 
deficit appears even if we look only at comparisons involving 
exercises administered both at 9 or 13 and at 17 or adu.'t. 

Urban Fringe 

The distributions of Urban Fringe effects for all exercises 
are shown in Exhibit 4—2. All four of the distributions 
have their medians above zero, by 3.0%, 3.4%, 2.4%, and 3.2% 
at the respective age levels. 

An Urban Fringe advantage appears for most exercises, about 
equally strongly at each age: at age 9, on 126 of 145 exer- 
cises (87%) ? at age 13, on 104 of 122 (85%) ; at age 17, on 
97 of 124 (78%) ? for adults, on 99 of 119 (83%) . 

Medium-Size Cities 



The distributions of Medium-Size City effects are shown in 
Exhibit 4-3. All four of the distributions have their medi- 
ans barely above zero, by 0.8%, 1.1%, 0.8%, and 0.4%. This 
slight but consistent tendency for Medium— Size City perfor- 
mance to be above the national average is mirrored also in 
the counts of individual exercises: 96 of 145 at age 9 (66%); 

82 of 122 at age 13 (67%) ; 83 of 124 at age 17 (67%) ; 66 of 
119 for adults (55%) . The modest dip in percentage at the adult 
level could come from the wider scatter of exercise effects 
due to the smaller adult sample sizes. 

Smaller Places 



The distributions of effects for Smaller Places appear in 
Exhibit 4-4. The four distributions all have medians below 
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Exhibit 4-1 



Distributions of Big City effects 
for all science exercises at four age levels 
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Exhibit 4-2 



Distributions of Urban Fringe effects 
for all science exercises at four age levels 
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145 
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Median 3.0% 
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Exhibit 4-3 



Distributions of Medium-Size City effects 
for all science exercises at four age levels 
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Exhibit 4-4 



Distributions of Smaller Place effects 
for all science exercises at four age levels 
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zero, by 1.2%, 1.1%, 2.1%, and 2.8% respectively. The pro- 
portions of exercises displaying a deficit are : 97 of 145 

at age 9 (67%) ? 78 of 122 at age 13 (64%) ; 90 of 124 at 

age 17 (73%) ; 98 of 119 for adults (82%) . The apparent 
worsening of Smaller Places' performance at the two upper 
levels, for adults especially, is of borderline statistical 
reliability. If this trend, though slight in terms of shifts 
in median performance, is taken to be a real effect, then one 
should note that it is the opposite of the Big City age trend. 
While Big Cities' performance improves by about 2% at ages 17 
and adult over ages 9 and 13, Smaller Places' performance per- 
haps gets worse, by 1% or 2%. 

Other Results 



Detailed discussion of size-of -community effects for different 
individual exercises and groups of exercises is deferred to 
Chapter 7. Examination of exercises producing atypical effects 
for each SOC group does not yield clear findings, although a 
few speculations are also given in Chapter 7 . 

The sole result worthy of report in the present overview con- 
cerns the differences for exercises designed for science 
Objectives 1 and 2. For size -of -community groups, as for 
regional groups, it appears that the group effects are sharper 
for Objective 2 exercises then for Objective 1 exercises. In 
particular, this is true for the general Big City deficit and 
the general Urban Fringe advantage. 



Exhibit 4-5 displays the generally sharper effects for Objec - 
tive 2 for all sixteen SOC-by-age combinations, arranged in 
order of strength of the amount of sharpening. Wc note that 
this phenomenon is strongest for the adult age group, weaker 
for 13- and 17-year-olds, and non-existent for 9-year-olds. 
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Exhibit 4-5 



Size-of -community median effects for exercises 
under objectives 1 and 2 

Medians for Amount by which Obj . 2 

SOC Group Age Obj . 1 Obj . 2 effect is more extreme 



(Arranged in < 


order of 


differences 


in effect 


between Objectives) 


Medium City 


Adult 


0.3 
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2 . 0 more 


Big City 


Adult 
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Smaller Place 
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1.6 more 
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17 
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-1.9 


equal 


Medium City 


17 


1.0 


0.9 
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Medium City 
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0.1 
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5. 



MALE-FEMALE DIFFERENCES IN DETAIL 



Chapter 2 presented the major findings from an analysis of male- 
female performance differences on the science exercises at the 
four age levels. In this chapter, we consider in somewhat great 
er detail the nature of atypical exercises at each age, and also 
we present results of an analysis of exercises in terms of the 
science objectives that they were designed to assess and of 
whether they deal with physical or biological science. The text 
of all released exercises may be found in NAEP Report Number 1. 

Age 9 



At age 9, the median difference in performance between boys and 
girls over all 145 science exercises is 0.5%. This is negli- 
gibly different from 0%. Also, as seen in Exhibit 2-1 on page 
6 , the boy-girl differences are approximately symmetric about 
zero. We recognize that such differences are subject to sampling 
error, and that nonzero differences may be the consequence of 
sampling fluctuations. Thus, even with no general tendency for 
boys to perform better or worse than girls, some observed dif- 
ferences would be positive and some would be negative, clustered 
more or less tightly around zero. The actual distribution for 
age 9 in Exhibit 2-1 closely resembles the distribution to be 
expected if there were no systematic sex differences in perfor- 
mance. 

Nevertheless, one science exercise at age 9 is atypical, in the 
sense that the boy-girl difference in performance is more distinct 
from the median difference for all exercises than we expect, 
based solely upon sampling variability. This is released Exer- 
cise No. 120, on which 76% of the boys give the right answer, 
compared with 68% of the girls. The exercise requires knowledge 
that a moving block striking a stationary block from the left 
will cause the latter to move to the right. 

Age 13 

At age 13, the median difference between boys and girls, over 
all 122 science exercises, is 1.7%. For 90 of these exercises, 

74% of the total number, boys perform better than girls. 

From Exhibit 2-1, it may be seen that performance of boys exceeds 
that of girls by at least 10%, the upper cutting point for male- 
female differences, for three exercises. Of these three exer- 
cises, one has been released (No. 214) and two remain unreleased 

ERJC 38 29 



so that they may be administered again to 13-year-olds when 
science is reassessed. Exercise No. 214 asks where a two-pound 
weight must be hung on the left side of a (pictured) beam to 
make the beam "balance, when a four -pound weight has been 
hung on the right side of the beam. Among boys, 61% give 
the correct response, while 48% of the girls answer correctly. 

Both unreleased exercises for which the observed sex difference 
is at least 10% also are apparatus exercises; respondents are 
to find the correct answers by performing experiments in elemen- 
tary physics. At age 13, then, exercises on which boys most 
clearly perform better than girls are exercises which pertain 
to simple experiments in physics. 

Another four exercises at age 13, all unreleased, are atypical, 
showing about 6% (the lower cutting point for differences) high- 
er performance by girls than boys. Analysis of the content of 
these four exercises provides no obvious clue as to why they 
are distinguished from the typical science exercise by the high- 
er performance of girls. Perhaps these exercises only appear 
atypical due to unusually large sampling error. 

Age 17 

For 17-year-olds, the distribution of sex differences over the 
124 science exercises is similar to that for 13-year-olds (Ex- 
hibit 2-1). The median difference is higher at age 17, 3.0%, 
and for a greater proportion of exercises (78%) boys perform 
better than girls. 

For nine exercises the difference between percent success for 
boys and girls is at least 11%. Two of the exercises that show 
a male-female difference of at least 11% were released and 
seven remain unreleased for readministration. The released 
exercises are: No. 320, which asks how long it will take for 

a boat travelling at 5 m.p.h. down a river which flows at 5 
m.p.h. to reach a point 10 miles downstream (males, 61% correct; 
females, 47% correct) ; and No. 341, an apparatus item requiring 
placing a weight on a beam so as to balance the beam (males, 

82% correct; females, 68% correct) . 

All seven unreleased exercises that display male-female dif- 
ferences of at least 11% at age 17 involve tasks similar to 
those of the released exercises in that the correct answer is 
more likely to be known by a person with knowledge of physics. 
Four of them require the completion of a simple physics experi- 
ment. As at age 13, then, those exercises on which 17-year-old 
males outperform females are physical science exercises. 
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At age 17 , one exercise atypically displays a sizable difference 
in favor of females. For this unreleased exercise, which taps 
knowledge of the reproductive system in the human female, 39% 
of the 17-year-old males and 47% of the females give the correct 
answer. 

Adults 



As noted in Chapter 2, many more exercises at the adult level 
show atypical sex differences than do exercises at the younger 
ages. The median difference in percent success for males versus 
females is 9.7%. For 82% of the 119 science exercises, male 
performance exceeded performance of females. However, several 
science exercises at the adult level are associated with a much 
higher level of success for women than for men. 

The total number of adults to whom a given exercise was admin- 
istered is smaller than the comparable numbers for 9-, 13-, or 
17-year-olds. Typically, an exercise was administered to more 
than 2,000 respondents at the younger ages, but only to about 
850 adults. Consequently, at the adult level a larger observed 
male-female difference is required in order that we be confident 
that the observed difference represents other than a sampling 
departure from the overall median difference. At the adult level, 
we give special attention only to exercises for which the observed 
male-female difference is in favor of males by 21% or more, or 
in favor of females by at least 2%. 

For 17 of the adult science exercises, at least 21% more males 
than females gave a correct answer. For another 17 exercises, 
performance of females was at least 2% better than performance 
of males. 

Of the 17 exercises for which males show a decisive advantage, 
six were released and are listed by exercise number in Exhibit 
5-1. Of the 17 exercises for which females performed at least 
2% better than males, 10 were released, and also are listed in 
Exhibit 5-1. 

As for exercises at ages 13 and 17, the adult exercises for which 
success of men is most decisively superior to success of women 
are those assessing knowledge of electronics and physics. Exer- 
cise No. 413, for which 33% more males than females get the right 
answer, requires knowledge that most of the chemical energy of 
gasoline burned in a car is converted to heat. Exercise No. 408, 
with a male-female difference of 32%, is the same as Exercise 
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Exhibit 5-1 





Released 


exercises at 
atypical sex 


the adult level 
differences 


wi th 






% Correct 
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No. 


Males 


Females 


% Difference 


413 




77 


44 


33 


408 




81 


49 


32 


415 




71 


40 


31 


410 




76 


50 


26 


438 




83 


61 


22 


43 0 




33 


11 


22 


407 




67 


69 


-2 


412 




58 


61 


-3 


402 




89 


93 


-4 


404 




77 


81 


-4 


405 




69 


73 


-4 


406 




67 


72 


-5 


441 




43 


54 


-11 


416 




49 


62 


-13 


418 




41 


62 


-21 


420 




31 


56 


-25 
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No. 323 at age 17, and asks the purpose of a fuse in an elec- 
trical circuit. Exercise No. 415, with a sex difference of 
31%, is identical to Exercise No. 320, administered at age 17. 

One must determine the time it will take a motorboat travelling 
5 m.p.h. on a river flowing 5 m.p.h. to reach a point 10 miles 
downstream. Exercise No. 410 (26% difference) is correctly 
answered by indicating that an electric current involves the 
movement of electrons in a copper wire. Exercise No. 438 (22%) 
is the same as Exercise No. 341 at age 17, an apparatus exer- 
cise that requires placing a weight on a balance beam so as to 
balance the beam. Exercise No. 430 requires knowledge that, if 
two light waves are travelling in a vacuum, the wave with the 
higher frequency will have the shorter wavelength (22% male- 
female difference). 

The exercises that are atypical in favor of women are very dif- 
ferent from these physical science exercises. Exercise No. 420 
asks the function of the placenta in a pregnant human female 
(to carry nourishment to the baby) ? 56% of females give the 

right answer, compared with 31% of males. Exercise No. 416 
requires knowledge that, on the average, an egg is released 
in human females 14 days after menstruation begins? the sex 
difference is 13% in favor of females. 

On Exercise No. 406, 5% more women than men state that adrena- 
line acts as a stimulant to the heart. Women perform better 
than men by 4% on an exercise (No. 405) assessing knowledge of 
the effects of sterilization in human males, and on an exercise 
(No. 404) requiring knowledge that whooping cough cannot be in- 
herited. 

Women outperform men by 4% in displaying knowledge that the sex 
of a human baby is determined by chromosomes (Exercise No. 402 ) y 
and by 3% in indicating that transplant surgery might be most 
successful if the donor is an identical twin (No. 412) . More 
women than men (2% more) also recognized that in mammals, sperm 
is produced by the testes (No. 407). 

Both of the remaining exercises (No. 441 and No. 418) on which 
women did better than men offer special problems of interpretation. 
For Exercise No. 441, an apparatus exercise administered to young 
adults, performance of women (54%) is almost 11% higher than 
that for men (43%). The apparatus consists of a pendulum — a 
weight on the end of a string. Given a watch with sweep second 
hand, the task is to find how long it takes for the weight to 
swing back and forth 10 times. When a respondent interpreted 



"back and forth" as a swing only to the left or to the right, 
something that was not uncommon, the answer was scored incorrect. 
When this same task was administered to 17-year-olds (Exercise 
No. 344), and to 13-year-olds (Exercise No. 237), boys performed 
better than girls, by 9% in each case. Misinterpretation of 
"back and forth" may have played a substantial part in this 
anomaly. 

Exercise No. 418, on which the difference between correct re- 
sponse for women and men is 21%, also presents special problems 
of interpretation; groups that generally perform well on other 
science exercises tend to give a wrong answer to this one. The 
respondent is told that a 5-pound rock is dropped from a cliff 
500 feet high. The longer the rock falls, the greater is its: 
acceleration, potential energy, speed, total energy, or volume? 
"Acceleration" and "potential energy" are both popular answers 
among groups of respondents who perform well on other physical 
science exercises. Both of these answers are wrong. The correct 
answer is "speed, " the answer supplied by 62% of women and by 
41% of men. 

The content of the unreleased adult exercises that display atypi- 
cal male-female differences resembles that for the released ex- 
ercises. All 10 of the unreleased exercises on which men succeed 
at least 21% more often than women tap knowledge of physics. Of 
the seven unreleased exercises on which women succeed at least 
2% more often than men, one asks a question concerning the fer- 
tilization of an egg in a human female, and another concerns the 
development of the human embryo; two require familiarity with 
human diseases, one refers explicitly to knowledge of recipes 
and clothing design, and the other two are more general science 
exercises, one from biology and one from astronomy. 

Physical Science versus Biological Science 

At each age level, exercises have been classified on the basis 
of their content into classes of "physical science" or "biological 
science." (Some exercises at each age assess general science 
facts, principles, or attitudes; they could not be classified 
either as physical or biological science and are not considered 
further in this section.) Comparisons of the distributions of 
male-female performance differences at each of the four age 
levels appear in Exhibits 5-2 to 5-5. 

Exhibit 5-2 shows that at age 9 sex differences are distributed 
nearly symmetrically about zero for both physical science exer- 
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Exhibit 5-2 



Male-female differences for 
physical and biological science exercises — age 9 
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Exhibit 5-3 



Male-female differences for 
physical and biological science exercises — age 13 
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Exhibit 5-4 



Male-female differences for 
physical and biological science exercises— age 17 
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Exhibit 5-5 



Male-female differences for 

physical and biological science exercises — young adults 
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No. of 
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cises and biological science exercises. This finding is con- 
sistent with the general result that 9-year-old boys and girls 
perform about equally well on the NAEP science exercises. 

Exhibit 5-3 shows the comparable distributions for age 13. Sex 
differences for physical and biological science items display 
similar distributions. However# as noted earlier# the only ex- 
ercises on which boys perform at least 9 % better turn girls are 
physical science exercises. 

Results for age 17 are shown in Exhibit 5—4. At this age# it 
is clear that all exercises on which boys perform much better 
than girls are physical science exercises. The results provide 
no evidence for any systematic sex difference on biological 
science exercises. 

For adults# the effects noted for 17-year-olds are accentuated 
(Ibchibit 5-5) . The vast majority of exercises for which men 
perform much better than women are exercises assessing knowledge 
of physical science. 

The pattern over age of medians for physical and biological 
science exercises was shown i-. Ibchibit 2-2# Chapter 2# which 
clearly shows that the advantage of males over females on 
physical science exercises increases with age. This finding 
is consistent with an expectation one might have based upon 
knowledge of different choice of science curriculum by boys and 
girls in high school. Boys and girls alike tend to be exposed 
to physical and biological science topics in elementary grades. 

In high school# science courses — physics# chemistry# biology# 
etc. — more often ara elective courses. Boys more frequently 
than girls choose to elect courses in physics and chemistry. 

The differential exposure of boys to a physical science curric- 
ulum might explain the advantage of boys and young men over girls 
and young women on the typical exercise that assesses physical 
science. 



Science Objectives, and Other Ways to Classify Exercises 

The finding of differential performance by males and females 
for physical science versus biological science exercises sug- 
gests that other ways to classify science exercises also may 
be sensitive to sex differences. One obvious way is by the 
science objective for which each exercise was designed* 
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Most of the science exercises written for each age level were 
intended to assess Objective 1. Fewer exercises were adminis- 
tered for Objective 2. (These include the apparatus exercises 
at ages 13, 17, and adult.) Still fewer exercises assessed 
Objectives 3 and 4. Exhibit 5-6 shows the number of exercises 
at each age level prepared for each objective. Exhibit 5-7 
shows the median male-female differences for each set of exer- 
cises at the four age levels. 

Such small numbers of exercises for Objectives 3 and 4 were ad- 
ministered as to make an evaluation of sex differences for those 
groups of exercises unrewarding. Larger numbers of exercises 
for Objectives 1 and 2 were administered at every age. 

At all four ages, the median male-female difference is positive 
(greater than zero) , both for exercises assessing Objective 1 
and for exercises assessing Objective 2. However, at each age, 
the median male-female difference is greater for Objective 2 
(Possess the abilities and skills needed to engage in the pro- 
cesses of science) than for Objective 1 (Know the fundamental 
facts and principles of science), although the differences be- 
tween objectives are small except at age 17. 
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Exhibit 5-6 



Number of exercises for each objective at four age levels 



Age 



Objective 


9 


13 


17 


Adult 


1 


96 


75 


89 


85 


2 


29 


31 


24 


24 


3 


11 


8 


6 


5 


4 


9 


8 


5 


5 


All exercises 


145 


122 


124 


119 



Exhibit 5-7 

Median male-female differences in percentage 
correct by objective at four age levels 



Objective j> 

1 .3 

2 1.0 

3 1.6 

4 - 1.2 



Age 

13 17 Adult 

2.0 2.6 9.4 

2.4 5.2 9.5 

-.5 .9 11.7 

.5 .2 

1.7 3.0 



All exercises 



5 



5.5 

9.7 



6 . 



REGIONAL EFFECTS IN DETAIL 



The focus of Chapter 3 was on summary and interpretation of 
the regional effects. In this chapter we provide additional 
data to support the generalizations given in Chapter 3, both 
those concerning significant regional effects and those con- 
cerning lack of significant effects. Where appropriate, refer- 
ence is made to the exhibits of Chapter 3. The text of all 
released exercises is published in NAEP Report Number 1. 

Northeast 



The distributions of regional effects for the Northeast are 
presented in Exhibit 3-1. At each age level, certain exercises 
fall so far from the median as to warrant special attention. 
Some of these have been released and can be examined by any 
reader of this report. They are listed by age level and by 
exercise number in Exhibit 6-1. The unreleased exercises ex- 
hibiting atypical effects, which are reserved for administra- 
tion again, will be described here only in a general way. 

Exhibit 6-1 

Released exercises showing atypical 
effects for the Northeast 



Age 


Exercise 

Number 


% 

Northeast 


Correct 

National 


% Difference 


9 


128 


66 


55 


11 




153 


48 


38 


10 




126 


68 


59 


9 




135 


34 


38 


-4 


13 


222 


31 


38 


-7 


17 


327 


45 


34 


11 


Adult 


420 


58 


45 


13 




441 


33 


48 


-15 
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Age 9 . At age 9 the median effect over all 145 exercises is 
2.2%. The effect is positive for 118 exercises (81%) and nega- 
tive for 27 (19%). 

For six exercises the Northeast advantage is at least 8%. Of 
these exercises, three were released (see Exhibit 6-1) and three 
remain unreleased. Exercise No. 128 calls for knowledge that 
the sun is a star rather than a planet, satellite, or solar 
system. The Northeast advantage is 11% (Northeast 66% correct, 
national 55%). Exercise No. 153 (Northeast advantage 10%) re- 
quires selection of the one thing that could not cause a pipe 
to become stopped up (the sink is full). Exercise No. 126 re- 
quires the respondent to know that most scientists think the 
center of the earth is very hot, rather than cold, dusty, or 
muddy. The Northeast advantage is 9%. There appears to ,be no 
particular pattern characterizing these exercises. Of the three 
unreleased exercises, one falls under Objective 1 and two under 
Objective 2. Two of the unreleased exercises are classified 
as physical science, one as biological science. 

There is a Northeast deficit exceeding 4% for three exercises, 
one of which has been released. Exercise No. 135 (Northeast 
34%, national 38%) requires the examinee to recognize that if 
a rock is submerged in water, the water will remain at the same 
level after the rock is broken into pieces. It is classified 
under Objective 1. The other two exercises have not been re- 
leased. Both are classified under Objective 1; one ' isesses 
knowledge of physical science, the other of biological science. 

Age 13 . At age 13 the median of the Northeast effects stands 
at 2%. For the 122 exorcises, 90 (74%) of the effects are posi- 
tive and 32 (26%) are negative. For only two of the exercises 
is the Northeast advantage as great as 9%, the value marking 
a positive atypical effect. Neither has been released. Both 
are laboratory exercises in pnysical science. 

Only one exercise (No. 222) exhibits a Northeast deficit in 
excess of 5%. It asks the respondent to identify the explana- 
tion for why giraffes have long necks that is consistent with 
the theory of natural selection. 

Age 17 . The overall Northeast effect at age 17 is represented 
by the median of 2.4%. The difference is positive for 93 (75%) 
of the 124 exercises and negative for 31 (25%) of them. Of the 
three exercises with an advantage for the Northeast as great 
as 10%, one (No. 327) was released. It asks how sound is car- 
ried through the air. Of the two unreleased exercises, one 
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calls for specific information related to physical science, and 
the other is a laboratory exercise. For the latter, the esti- 
mated error is so large as to suggest that the atypical effect 
is quite likely due to sampling variability. 

No exercise exhibits a Northeast deficit large enough to be con- 
sidered atypical. 

Adults . The median of the distribution of differences for adults 
is 0.7%, a difference so small that one feels safe in concluding 
that the adults in the Northeast perform about the same as the 
national sample when all exercises are considered. Of the 119 
exercises, 62 are easier for the Northeast and 57 are easier 
for the national sample. 

When the small positive difference and the relatively smaller 
size of the sample at the adult level are taken into account, 
one concludes that an advantage of 11% or a deficit of 9% would 
be required before the discrepancy could be considered atypical. 
Only one exercise exhibits an atypical Northeast advantage and 
two exercises an atypical Northeast deficit. Exercise No. 420 
(Northeast advantage 13%) calls for knowledge of the function 
of the placenta. Exercise No. 441 (Northeast deficit 15%) is 
an apparatus exercise with an estimated sampling error large 
enough to account for the discrepancy. The unreleased exer- 
cise exhibiting an atypical deficit asks for physics infor- 
mation. 

All Ages . Altogether there are 18 exercises exhibiting atypical 
effects for the Northeast, nine at age 9, three at age 13, three 
at age 17, and three for adults. Twelve show an advantage for 
the Northeast and six a deficit. A majority (11) assess knowl- 
edge (Objective 1) . There is a Northeast advantage for six, 
a deficit for five. The other seven fall under Objective 2. 

Of these, two are laboratory exercises with estimated errors 
large enough to account for the atypical effect. One shows a 
Northeast advantage, the other a Northeast deficit. The other 
five all exhibit a Northeast advantage. TI.uj, there appears 
to be a tendency for the Northeast to show an atypical advantage 
on a higher proportion of Objective 2 than of Objective 1 exer- 
cises . 

Science Objectives . The finding of a Northeast atypical advantage 
on Objective 2 exercises suggests that it might be appropriate 
to look at all the exercises in terms of their classification 
by objective. (See Chapter 1 for the list of object ivesj 
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The number of exercises classified under each objective varies; 
most fall under Objective 1, somewhat fewer under Objective 2, 
and only a small number under Objectives 3 and 4. Exhibit 6-2 
presents the Northeast median effects by objective and identifies 
the number of exercises in each classification. 



Exhibit 6-2 



Median Northeast regional effects for objectives 1 and 2 and 
for all exercises (Numbers of exercises in parentheses) 

Objective Age 



1 

2 

All exercises 



9 

1.9 (96) 
2.7 (29) 
2.2 (145) 



11 

1.5 (75) 
3.2 (31) 
2.0 ( 122 ) 



17 

2.1 (89) 

3.4 (24) 

2.4 (124) 



Adult 
1.0 (85) 

-1.1 (24) 

0.7 (119) 



At ages 9, 13, and 17 the median effect is greater for Objective 
2 exercises than for Objective 1 exercises. At age 9 the dif- 
ference is 0.8%, at age 13, 1.7%, and at age 17, 1.3%. In no 
case is the difference larger than might be expected simply as 
a result of sampling fluctuations. 

At the adult level, the difference is reversed. The adult median 
for Objective 1 exercises is 1.0%. For Objective 2 exercises 
it is -1.1%. The difference could well come from sampling vari- 
ability. 

Physical Science versus Biological Science . Exercises have been 
classified on the basis of their content as well as on the basis 
of the objective being assessed. Most exercises were classified 
as either physical science or biological science; a few were 
not so classified. A summary of the data for the Northeast sample 
relative to performance on physical science and biological science 
exercises is presented in Exhibit 6-3. No systematic differences 
are discernible. 
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Exhibit 6-3 



Median Northeast regional effects for physical and biological 
science exercises (Numbers of exercises in parentheses) 

Age 



Physical science 



2.5 (78) 



11 

2.2 (67) 



17 

2.5 (78) 



Adult 
-0.2 (63) 



Biological science 1.9 (48) 



2.1 (39) 



1.8 (33) 



1.1 (44) 



Southeast 



The regional effects for the Southeast are presented in Exhibit 
3-2 of Chapter 3. Here we examine the data in more detail. 

The released exercises exhibiting atypical effects for the 
Southeast are listed in Exhibit 6-4. There are 14 of them. 

An additional 28 unreleased exercises exhibited atypical South- 
east effects. 



Exhibit 6-4 

Released exercises showing atypical effects for the Southeast 





Exercise 


% 


Correct 




Age 


Number 


Southeast 


National 


% Difference 


9 


138 


38 


35 


3 




140 


24 


14 


10 




123 


44 


67 


-23 




130 


39 


53 


-14 




156 


66 


80 


-14 


13 


217 


55 


48 


7 




227 


30 


26 


4 




247 


11 


8 


3 




233 


57 


71 


-14 




234 


49 


63 


-14 




236 


46 


61 


-15 


17 


309 


51 


66 


-15 




320 


37 


52 


-15 




353 


59 


74 


-15 
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Age 9 . At age 9 the median effect for the Southeast is a deficit 
of 5.1%. On 13 of the 145 exercises the Southeast does better 
than the country as a whole; on 132 it performs less well. 

Of the six exercises on which the Southeast performed at least 
3% better than the country as a whole, two were released. Exer- 
cise No. 138 (Southeast advantage 3%) asks about the relationship 
of a cold front to the weather behind it. Exercise No. 140 
(Southeast advantage 10%) assesses knowledge of the fact that 
coal was formed from the remains of dead plants. Of the four 
unreleased exercises, three ask for factual information (Objec- 
tive 1), the other for an understanding of the investigative 
nature of science (Objective 3). 

On six exercises the Southeast exhibits a deficit of 13% or 
more. One (Exercise No. 123) asks what scientists learn from 
studying fossils; one (Exercise No. 130) requires the knowl- 
edge that the most comfortable inside temperature is thought 
to be about 70 degrees; one (Exercise No. 156) probes aware- 
ness that natural phenomena, even though puzzling, must have a 
scientific explanation. All three of the unreleased exercises 
exhibiting a southeast deficit as large as 13% deal with physi- 
cal science content. Two exercises ask factual questions; the 
third requires the reading of a simple bar graph. 

Age 13 . The median Southeast deficit at age 13 is 4.8%. There 
is a deficit for 1C8 of the 122 exercises (89%), and only 14 
(11%) exhibit an advantage for the Southeast. 

The advantage is at least 3% for four exercises, three of which 
are released. Exercise No. 217 requires an understanding that 
when gasoline is burned in an automobile engine, most of the 
energy is released as heat (Southeast advantage 7%) . Exercise 
No. 227 assesses knowledge that the seat of intelligence in 
mammals is the cerebrum (Southeast advantage 4%). Exercise No. 
247 is directed at an attitude about science. Compared to the 
national sample, 3% more of the Southeast sample reported they 
often asked scientific questions. However, in neither case is 
the percentage large (Southeast 11% correct, national 8%) . The 
unreleased exercise showing a Southeast advantage also assesses 
an attitude about science. 

At age 13 there are 11 exercises showing a Southeast deficit of 
at least 12%, seven of which are classified under Objective 2. 

All three of the released exercises involve skills, two the 
reading of graphs and one the manipulation of laboratory equip- 
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ment. Exercise No. 233 (Southeast 57% correct, national 71%) 
requires the selection of the statement which correctly reports 
what is represented on a graph. Exercise No. 234 (Southeast 
deficit 14%) asks the student to determine where a weight of 
two grams must be placed to balance a bar with a four-gram 
weight on the other side of the pivot. Exercise No. 236 (South- 
east deficit 15%) asks for an inference about the feeding 
schedule of a dog from an examination of a graph of weight over 
time. Of the unreleased exercises, four resemble the released: 
two call for interpretation of a graph or table, one for manip- 
ulating a balance beam, and one for conducting an experiment 
and graphing the data. Of the four remaining, two ask for 
physical science information, one for biological science infor- 
mation, and one involves an attitude toward science. 

The number of questions showing atypical effects at age 13 is 
larger than that observed at other ages. When the estimates 
of sampling error for the individual exercises are examined, 
it appears that for five of the seven laboratory exercises, 
the estimated sampling error is large enough to account for the 
atypical effect. The two for which this is not the case are 
released Exercise Nos. 233 and 236. 

Age 17 . At age 17 the median Southeast effect is a deficit of 
4.8%, exactly the same as at age 13. On only 18 (15%) of the 
124 exercises is there an advantage for the Southeast. 

Of five exercises showing an advantage as great as 3%, none of 
which have been released, four seek factual knowledge in the 
physical sciences (Objective 1) and one seeks knowledge of the 
investigative nature of science (Objective 3). Seven exercises 
exhibit a Southeast deficit as great as 13%, three of which 
have been released. One (Exercise No. 309) requires the ability 
to relate Darwin to the theory of natural selection; one 
(Exercise No. 320) asks for solution of a problem involving 
determining the time to travel downstream a distance of 10 
miles, given the speed of the boat and the speed of the cur- 
rent. The third (No. 353) presents the statement, "U.S. 
scientists are ahead of scientists in every other country in 
every field of research." In the total sample 74% responded, 

"I don't believe this." In the Southeast only 59% gave that 
response. Of the unreleased exercises with Southeast deficits 
in excess of 13%, three are biological science content and 
assess Objective 1; the other is a laboratory exercise re- 
quiring the respondent to make a physical measurement. 
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Adults . The median effect for adults in the Southeast is a 
deficit of 5.1%. For 15 of the 119 exercises (13%) the South- 
east does better than the country as a whole and for 104 exer- 
cises (87%) the Southeast does less well. Three exercises 
show a Southeast deficit of 18% or more, none of which have 
been released. They all test science knowledge; one is re- 
lated to human reproduction, one to the solar system, and one 
to the basic nature of living organisms. 

All Ages . As reported in Chapter 3, when the 42 exercises ex- 
hibiting atypical Southeast regional effects are accumulated 
over ages, there do seem to be systematic differences between 
those showing an atypical advantage and those showing an 
atypical deficit. One difference is in the relative frequency 
of exercises testing Objective 2 skills, none of 15 exercises 
showing an advantage in contrast to nine of 27 exercises show- 
ing a deficit. Some, but certainly not all, of this contrast 
may be the result of the larger sampling errors for laboratory 
exercises. The other difference is in the nature of the exer- 
cises assessing Objective 1. Those exhibiting a Southeast 
advantage seem to deal with everyday phenomena — coal from 
a neighboring mountain, transformers, automobiles, food, rain- 
bows, and the weather. Those exhibiting a disadvantage tend 
to deal either with facts learned in textbooks — the size and 
motion of the planets, the scientific vocabulary of human re- 
production, Darwin and natural selection, the significance of 
fossils — or with more than simple recall — the ability to recog- 
nize the example that fits a definition or to calculate a 
quantity given the basic relationships. 

Science Objectives . The median Southeast effects for exercises, 
grouped by science objectives, are reported in Exhibit 6-5. 

The only systematic pattern that emerges is the larger deficit 
for Objective 2 exercises at ages 13 and 17. At age 13 the 
median disadvantage for th;* 75 Objective 1 exercises is 4.5% 
while that for the 31 Objective 2 exercises is 7.7%. At age 
17, the corresponding figures are 4.2% and 7.3%. The dif- 
ferences at ages 9 and adult could easily be attributed to 
sampling fluctuations. 
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Exhibit 6-5 



Median Southeast regional effects for Objectives 1 and 2 and 
for all exercises (Numbers of exercises in parentheses) 



Objective Age 



1 


9 

-5.1 


(96) 


13 

-4.5 


(75) 


17 

-4.2 


(89) 


Adult 

-5.5 


(85) 


2 


-5 . 6 


(29) 


-7.7 


(31) 


-7.3 


(24) 


-4.5 


(24) 


All exercises 


-5.1 


(145) 


-4.8 


(122) 


-4.8 


(124) 


-5.1 


(119) 



Physical Science versus Biological Science . Median Southeast 
regional effects and numbers of exercises classified according 
to content are reported in Exhibit 6-6. At ages 9, 17, and 
adult the medians show less of a deficit for biological science 
than for physical science; at age 13 the direction is reversed. 

None of the contrasts is statistically significant. 

Exhibit 6-6 

Median Southeast regional effects for physical and biological 
science exercises (Numbers of exercises in parentheses) 

Age 

9 13 17 Adult 

Physical science -5.2 (78) -4.8 (67) —5.1 (78) -5.3 (63) 

Biological science -4.7 (48) -5.0 (39) -4.6 (33) -4.9 (44) 

Central 



The distributions of regional effects for the Central region are 
displayed in Exhibit 3-3 and discussed in Chapter 3. The medians 
at ages 9 and 13 are higher than for the national sample, those 
for ages 17 and adult about the same as for the national sample. 

All Ages . Over all four ages, there are 12 exercises showing 
atypical effects, three of which have been released. At age 9, 
Exercise 123 (Central 76% correct, national 67%) asks what 
scientists can learn from studying fossils. Exercise 138 
(Central 29% correct, national 35%) tests knowledge of the 
effect on weather of the passage of a cold front. The three 
unreleased exercises at age 9, one showing a positive effect, 
the other two a negative effect, do not seem to help in estab- 
lishing a pattern. 
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Nor do the exercises at other ages. There are nc atypical 
exercises at age 13 and only two, both unreleased, at age 17. 

For adults. Exercise 405 (Central 81% correct, national 71%) 
asks about the effects of tying off sperm ducts. 

When all atypical exercises, both reported and unreported at 
the four ages, are examined, one is left with the impression 
that most of the atypical effects are the result of sampling 
fluctuations. There appears to be no pattern characterizing 
these exercises. Furthermore, of the 12 exercises, seven were 
administered at more than one age, yet in no case did an exer- 
cise exhibit an atypical effect at more than one age level. 

It may be noted, however, that four of the nine exercises on 
which the Central region performs markedly better than the 
country as a whole are classified under Objective 2 and that 
none of the three exercises exhibiting a deficit fall in this 
classification. To investigate the possibility that these 
findings mark a trend, we examine all the exercises after group- 
ing them by objectives. 

Classes of Exercises . The median effects for exercises classi- 
fied under Objectives 1 and 2 and for all exercises are re- 
ported in Exhibit 6-7. 



Exhibit 6-7 

Median Central regional effects for objectives 1 and 2 and 
for all exercises (Numbers of exercises in parentheses) 



Objective 






Age 








9 




13 


17 


Adult 


1 


1.3 


(96) 


2.1 (75) 


-0.3 (89) 


0.1 (85) 


2 


2.7 


(29) 


2.3 (31) 


0.2 (24) 


2.1 (24) 


All exercises 


1.6 


(145) 


1.9 (122) 


0.0 (124) 


0.6 (119) 



At each age the median effect shows a greater advantage for Objec- 
tive 2 exercises than for Objective 1 exercises. The difference 
for adults is the largest of the four. Of 85 adult exercises clas- 
sified under Objective 1, 44 show an advantage, and 41 a deficit. 

Of 24 classified under Objective 2, 21 show an advantage and only 
three a disadvantage. The median over all exercises for adults in 
the Central region is only slightly above the national: 0.6t. If 
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there is an advantage for adults in the Central region, it must 
be attributed to superior performance on Objective 2 exercises. 

The classification of exercises according to science content 
(Exhibit 6-8) provides no evidence of any significant contrasts. 

Exhibit 6-8 

Median Central regional effects for physical and biological 



science exercises 


(Numbers 


of exercises in 
Aqe 


parentheses ) 








9 




13 


11 


Adult 


Physical science 


1.6 


(78) 


2.3 (67) 


-0.3 (78) 


0.9 


(63) 


Biological science 


1.6 


(48) 


1.8 (39) 


1.1 (33) 


-0.1 


(44) 



West 



The distributions of West regional effects at the four age levels 
are presented in Exhibit 3-3 and discussed in Chapter 3. 

All Ages . Over all ages there are 17 exercises showing atypical 
regional effects for the West, only two of which have been re- 
leased. At age 17, Exercise No. 309 (West 75% correct, national 
66%) requires relating Darwin to the theory of natural selection. 
(It was also administered to adults and at that age the West 
advantage was 5%) 

The other exercise. No. 445 (West 1% correct, national 11%), 
is a laboratory exercise requiring the adult examinees to cal- 
culate the density of a wooden block after finding the mass and 
the volume. It did not exhibit an atypical effect when it was 
administered at ages 13 or 17. 

The 17 atypical exercises are divided about equally (eight versus 
nine) between those above the median for the West and those 
below; the number (four) of exercises classified under Objec- 
tive 2 is too small to show any trend. 

Classes of Exercises . When exercises are grouped by objective, 
the median effects for the West for objectives are those re- 
ported in Exhibit 6-9. None of the differences between medians 
for Objective 1 and Objective 2 exercises is statistically 
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significant. However, when these data are summarized across 
all comparisons of region and age (see Exhibit 3-5) the tendency 
for Objective 2 effects to be in the same direction but more 
extreme than those for Objective 1 is a statistically signif- 
icant tendency at ages 17 and adult in the West. 

Exhibit 6-9 

Median West regional effects for Objectives 1 and 2 and 
for all exercises (Numbers of exercises in parentheses) . 



Objective Age 





9 




13. 




17 


Adult 


1 


0.6 


(96) 


-0.6 


(75) 


1.1 (89) 


2.3 (85) 


2 


-1.3 


(29) 


-0.6 


(31) 


2.6 (24) 


2.9 (24) 


All exercises 


0.0 


(145) 


-0.6 


(122) 


1.7 (124) 


2.6 (119) 



In this context, the median effects at ages 9 and 13 in the West ap- 
pear as exceptions to the general finding. 

The pattern of median effects for physical science and biolog- 
ical science exercises for the West (Exhibit 6-10) follows 
closely the pattern for all exercises reported in Chapter 3. 

Exhibit 6-10 

Median West regional effects for physical and biological 
science exercises (Numbers of exercises in parentheses) 

Age 

9. 13. 17 Adult 

Physical science -0.2 (78) -0.9 (67) 1.5 (78) 1.8 (63) 

Biological science 0.5 (48) -0.1 (39) 2.0 (33) 3.3 (44) 

Exercises Administered At More Than One Age 

Most of the exercises administered at an age level were developed 
especially for that age; however, some of the exercises at 
each age level were also administered at one or more other ages. 
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The numbers of these overlapping exercises varied from 14 
administered to both 9s and adults to 62 given to both 17s 
and adults. 

The comparisons reported in Chapter 3 and thus far in Chapter 
6 were developed using data from all the exercises available 
at an age and for a classification. Would the picture be the 
same if one were to limit the comparisons to exercises common 
to more than one age? 

In those instances where there are few exercises common to 
two ages, the data are insufficient either to confirm or to 
refute the conclusions based on all exercises. Whenever 
there are a sufficient number of common exercises to warrant 
drawing a conclusion, the data in general support the trends 
observed over all exercises. 
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SIZE-OF-COMMUNITY EFFECTS IN DETAIL 



Chapter 4 was mainly concerned with the size of community ef- 
fects which were typical across all ages and exercises. Here 
we turn to the details characterizing atypical exercises and 
predefined classes of exercises. (Each of the released exer- 
cises is available in NAEP Report Number 1.) Each of the four 
size-of-community (SOC) groups is considered in turn. 

Big Cities 



Atypical Exercises . We first, examine specific exercises on 
which Big City performance i. 3 atypically high or atypically 
low. The procedure for identifying such atypical exercises 
is explained in Appendix A. 

At age 9, Big City performance on one exercise exceeds the 
upper cutting point of 3%, determining those exercises which 
are atypical for this group (37% Big City correct versus 33% 
national) . It is an unreleased exercise about words used to 
refer to scientific generalizations. On four other exercises 
at age 9, all unreleased. Big Cit} performance is atypically 
low — at least 13% worse (the lower cutting point for this 
group) than national. Three of these four exercises, which 
cover simple properties of campfires, garden plants, and 
fish, involve content material to which Big City children may 
have had less real-life exposure than small town, country and 
suburban children. The fourth exercise deals with household 
uses of electricity, and it is not clear why there is a large 
Big City deficit on this particular exercise. 

Only one exercise can clearly be considered atypical for Big 
City 13-year-olds, an unreleased exercise concerning the con- 
cept of "fact" in science, on which there was a Big City 
deficit of 14%. One of the incorrect alternatives, which was 
open to an unplanned interpretation, was 10% more attractive 
to the Big City respondents than to all 13s. 

At age 17, Big City respondents do atypically well on four ex- 
ercises, two of which are released. Exercise No. 328 (Big 
City advantage 6%) requires the knowledge that DNA is present 
equally in egg and sperm cells. Exercise No. 322, which shows 
a Big City advantage of 5%, requires the respondent to attri- 
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bute overweight in a light eater to "highly efficient utiliza- 
tion of food by the body." The two unreleased exercises with 
Big City advantages as large as 5% concern the properties of 
temperature, and the proper arrangement of certain chemical 
apparatus. These four exercises are thus seen to involve rather 
miscellaneous content. 

Six exercises at this age show atypical deficits (-10% or more) . 
Exercise No. 323, with a Big City deficit of 11%, tests the 
knowledge that the purpose of a fuse is to prevent possible 
damage to the circuit. Exercise No. 320 (Big City deficit of 
11%) is a boat and river arithmetic problem requiring simple 
addition of velocities to deduce the time necessary to reach 
a downstream destination. In Exercise No. 305 (deficit 11%), 
the respondent must select "the movement and characteristics 
of air masses" as the most important of several factors in pre- 
dicting weather. One might conjecture that the crucial common 
element of these exercises, as for age 9, is material with 
which Big City 17-year-olds have less practical personal ex- 
perience: fuses, flowing streams, and weather prediction. 

The three unreleased exercises with Big City deficits exceed- 
ing 10% involve bits of knowledge concerning cement, trans- 
formers, and building materials — again consistent with the 
"practical experience" explanation. 

There are fewer clues among the atypical exercises at the adult 
level. One unreleased multi-part apparatus exercise has the 
curious property that Big City adults perform 16% worse than 
the national average on the first part and 11% better on the 
last part. In view of the large sampling error for apparatus 
exercises, perhaps this instability is due to chance. On 
released Exercise No. 407 involving the perhaps unfamiliar 
word "testes" in connection with the production of sperm cells, 
15% fewer Big City adults than all adults give the correct 
answer. An unreleased exercise on the world's food supply 
displays a 16% Big City deficit. 

Science Objectives . The lack of out-of-school experience with 
certain kinds of content (e.g., "outdoor" things, such as 
gardens, streams, campfires and cement) may contribute substan- 
tially to a typically large Big City deficits. This speculation 
does not explain why there is a consistent small Big City deficit 
on the majority of exercises at all ages. 

Another way to categorize exercises is by the four science ob- 
jectives. Exhibit 7-1 presents the median Big City effects 
at each age level for exercises falling under each of the 
objectives. 
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Exhibit 7-1 

Median Big City effects for exercises under each objective 
(Numbers of exercises in parentheses) 



Objective 



Age 



13 



17 



Adult 



1 


-5.0 


(96) 


-4.6 


(75) 


-2.4 


(89) 


-2.3 


(85) 


2 


-5.0 


(29) 


-4.8 


(31) 


-3.8 


(24) 


-4.2 


(24) 


3 


-4.0 


(ID 


-5.0 


( 8) 


-1.9 


( 6) 


1.2 


( 5) 


4 


-2.3 


( 9) 


-1.9 


( 8) 


-1.7 


( 5) 


0.4 


( 5) 


exercises 


-4.8 


(145) 


-4.7 


(122) 


-2.6 


(124) 


-2.4 


(119) 



The comparison based on the most exercises is between Objectives 1 
and 2. The data of Exhibit 7-1 suggest a very slightly worse 
Big City performance on Objective 2 exercises (Possess the abil- 
ities and skills to engage in the processes of science) than 
Objective 1 exercises (Know the fundamental facts of science) 
at ages 17 and adult. The differences between Objective 1 and 
Objective 2 medians at these age levels are 1.4% and 1.9%. Each 
of these differences might plausibly have arisen from sampling 
error. Nevertheless, a consistent pattern of difference between 
the two objectives for other SOC groups suggests that the apparent 
slight difference between performances on the two objectives 
for Big City respondents may be real. 

The Big City median on Objective 4 (Show appreciation of scien- 
tists and science) , is above the medians for Objectives 1 and 
2 at all ages. This relatively better performance is based on 
very small numbers of exercises, but the consistency over ages 
gives a fair degree of assurance of its reality. It is perhaps 
not surprising that Big City children and adults hav^ attitudes 
almost as favorable toward science as respondents in general, 
despite the fact that there is a median deficit of 2% to 5% in 
science knowledge and skill for Big City respondents. 

Objective 3 (Understand the investigative nature of science) 
falls somewhere in between the others. At ages 9 and 13 there 
is a deficit comparable to that for Objectives 1 and 2, but at 
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ages 17 and adult, performance on Objective 3 seems somewhat 
better than on Objectives 1 and 2, and in fact is close to 
national performance. The number of exercises is too small, 
however, to draw firm conclusions about something as subtle as 
an age trend for this objective. The trend over ages for 
Objective 1, however, is based upon enough exercises to be 
regarded as reliable. We note from Exhibit 7-1 that, for Ob- 
jective 1 exercises, there is approximately a. 2.5% improvement 
in 17 and adult Big City performance over 9- and 13-year-old 
performance. This trend quite closely mirrors the Big City 
age trend for all exercises. 

Physical Science versus Biological Science . Exhibit 7-2 presents 
the median Big City effects at each age level for exercises 
classified as physical or biological science. A few unclassi- 
fied exercises are omitted from this summary. There is no con- 
sistent difference in Big City performance between the two 
types of science exercises. The smaller deficits at the upper 
age levels reflect the general age trend. 



Exhibit 7-2 

Median Big City effects for physical and 
biological science exercises 

(Numbers of exercises in parentheses) 



Age 





9 


13_ 


17 


Adult 


Physical science 


-4.5 (78) 


-4.6 (67) 


-3.3 (78) 


-2.8 (63) 


Biological science 


-5.3 (48) 


-4.7 (39) 


-1.9 (33) 


-3.0 (44) 



Urban Fringe 

Atypical Exercises . At age 9, three exercises show atypically 
large Urban Fringe advantages. One of these, released Exercise 
No. 123, asks what scientists learn by studying fossils. Fifteen 
percent more 9-year -olds in the Urban Fringe, compared to all 
95, know the answer concerns animals who lived long ago. An 
unreleased exercise with an Urban Fringe advantage of 11% involves 
a simple geological fact. Another unreleased exercise with a 
9% advantage deals with household uses of electricity. Two 
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atypical unreleased exercises with deficits of 3% and 5% respec- 
tively involve simple properties of nutrition and weather. 

At age 13, two unreleased exercises on which Urban Fringe re- 
spondents do much better than the national average (by 10% and 
12%) involve quantitative matters from genetics and from a 
mechanical apparatus set-up, and a third (11% advantage) asks 
for general knowledge about atoms. Among the five atypical 
exercises with Urban Fringe disadvantages of from 3% to 5% are 
parts of three released apparatus exercises. No. 234, No. 237, ! 

and No. 240. Respondents are asked to select a weight to 
balance a balance pan, to use the measured density of a wood 
block to specify the fraction of it which would be above water 
during floating, and to time 10 swings of a pendulum. (We 
have already noted special problems of interpretation for the 
last of these, at least when given to adults.) Two unreleased 
exercises with similar Urban Fringe deficits concern blood 
cells and measurement concepts. 

At age 17, two released exercises with Urban Fringe advantages 
of 9% are No. 342 and No. 353. The former is an ecological 
question about the effects of a decrease in a rabbit population 
on grass and on hawks. The latter seeks a correct judg- 
ment that U.S. scientists are not ahead of the scientists of 
all other countries in all fields. Two unreleased exercises 
(Urban Fringe advantages of 12% and 11%) concern straightfor- 
ward chemical and genetic matters, and one (10% advantage) is 
a complex question about conditions on the moon. The one 
exercise on which Urban Fringe respondents do a typically worse 
(5%) than national is No. 344, the same as No. 240 noted above 
at age 13. This requires timing 10 swings of a pendulum, and 
as given to adults has been noted (in Chapter 5) as presenting 
special problems of interpretation. 

In contrast to the fact that Urban Fringe 13- and 17-year-olds 
do a typically poorly on timing a pendulum (Exercise No. 

441 for adults) , Urban Fringe adults perform 17% better than 
the national average on this same exercise l Again, misinter- 
pretation of wording is a likely contributing cause. On two 
other released adult exercises. No. 409 and No. 447, the Urban 
Fringe advantage is 14% and 16%, respectively. In No. 409, the 
name of Darwin must be associated with the theory of evolution. 

In No. 447, the respondent must choose the answer, "A possible 
explanation for observations/' as the best of several given 
meanings of the term "theory." A final example of atypica.lly 
high Urban Fringe advantage (14%) is the unreleased quantitative 
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mechanics exercise on which 13-year-old Urban Fringe respondents 
also had an advantage (12%) . No exercises at the adult level 
fell below the cutoff used to define atypically poor performance. 

There seems nothing strongly in common among the various examples 
of exercises with especially good Urban Fringe performance at 
all ages, except possibly a tendency toward slightly esoteric 
science content (fossils, atoms, definition of a "theory," etc.). 

Science Objectives . Exhibit 7-3 shows median Urban Fringe ef- 
fects by science objective at each age level. 



Exhibit 7-3 

Median Urban Fringe effects for exercises 
under each objective 

(Numbers of exercises in parentheses) 



Age 

Objective 9. 13_ 1_7 Adult 



1 


3.1 


(96) 


3.3 


(75) 


1.9 


(89) 


3.0 


(85) 


2 


3.6 


(29) 


4.0 


(31) 


3.2 


(24) 


4.4 


(24) 


3 


2.5 


(11) 


4.2 


( 8) 


4.2 


( 6) 


7.1 


( 5) 


4 


2.0 


( 9) 


1.0 


( 8) 


3.3 


( 5) 


2.4 


( 5) 


All exercises 


3.0 


(145) 


3.4 


(122) 


2.4 


(124) 


3.2 


(119) 



Comparing Urban Fringe effects on Objectives 1 and 2, we find 
consistently better performance on Objective 2 exercises: by 

.6%, .7%, 1.3%, and 1.4% at the four age levels. The net 
magnitude of these consistent differences is sufficient to 
have moderate confidence that they are not accidents of sampling 
error. Objective 2 exercises slightly sharpen the general 
advantage of Urban Fringe respondents. These same exercises 
slightly sharpen the general deficit of Big City respondents, 
as noted in the last section. 

The small number of exercises for Objectives 3 and 4, and the 
irregularity of the pattern of results from them in Exhibit 7-3, 
preclude making any confident generalizations about their 



o 

ERIC 



69 



60 



effects. However, in most cases the median advantages are 
smaller (nearer zero) for Objective 4. Recalling that Big City 
median deficits were nearer zero for Objective 4, we might con- 
jecture that the exercises on this objective tend to show smaller 
distinctions between population groups. 

Physical Science versus Biological Science . Exhibit 7—4 gives 
median Urban Fringe effects for physical and biological science 
exercises. No consistent difference between these two types 
of exercises is revealed. 



Exhibit 7-4 

Median Urban Fringe effects for physical and 
biological science exercises 

(Numbers of exercises in parentheses) 



Age 

9 13_ 17 Adult 

Physical science 3.6 (78) 3.3 (67) 2.3 (78) 3.0 (63) 

Biological science 2.4 (48) 2.9 (39) 1.9 (33) 3.5 (44) 



Medium-Size Cities 

Atypical Exercises . In general, the Medium Cities distributions 
in Exhibit 4-3 (page 25) are more compact than was true for pre- 
vious SOC categories (compare Exhibits 4—1 and 4-2 on pages 23 
and 24). Thus there are fewer atypical or highly extreme exer- 
cises . 

At age 9, Medium City respondents do atypically well (7% advan- 
tage) on one unreleased exercise concerning the relative den- 
sities of common substances. 

No atypical exercises appear at age 13 . At age 17, Medium Cities 
respondents performed unusually well (7-10% above national) on 
one released and five unreleased exercises. Exercise No. 318 
requires the respondent to know that in most chemical changes, 
atoms are rearranged into new molecules . The unreleased exer- 
cises tap knowledge about pendulums, simple diseases, electric 
charge, simple analytic geometry, and how to plot and interpret 
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data from an experiment in mechanics. The one exercise with 
an unusual Medium City deficit (6%) at age 17 is No. 322. It 
requires the respondent to attribute overweight in a light 
eater to "highly efficient utilization of food by the body." 

At the adult level, two unreleased exercises yield atypically 
low Medium Cities performance. One concerns a fact of astron- 
omy, and the other requires a generalization from measurements 
the respondent has just made with a simple apparatus. 

Science Objectives . Exhibit 7—5 gives median Medium City 
effects by objective. With only one possible exception, the 
differences from objective to objective may reasonably reflect 
only inevitable sampling fluctuations. At the adult level, 
Objective 2 performance is 2.0% better than Objective 1 perfor- 
mance, a difference of borderline statistical reliability. 



Exhibit 7-5 

Median Medium City effects for exercises 
under each objective 

(Numbers of exercises in parentheses) 



Age 

Objective 9 _ L3 17 _ Adult 



1 


0.9 


(96) 


1.2 


(75) 


1.0 


(89) 


0.3 


(85) 


2 


0.1 


(29) 


1.7 


(31) 


0.9 


(24) 


2.3 


(24) 


3 


1.4 


(ID 


0.3 


( 8) 


0.1 


( 6) 


-0.2 


( 5) 


4 


0.9 


( 9) 


0.5 


( 8) 


0.2 


( 5) 


-0.2 


( 5) 


All exercises 


0.8 


(145) 


1.1 


(122) 


0.8 


(124) 


0.4 


(119) 



Physical Science versus Biological Science . Exhibit 7-6 gives 
medians for physical and biological science exercises. It is 
difficult to interpret the small variations displayed. 
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Exhibit 7-6 

Median Medium City effects for physical and 
biological science exercises 

(Numbers of exercises in parentheses) 



9 

Physical science 0.6 (78) 
Biological science 1.2 (48) 



Age 

13 17 

0.8 (67) 0.6 (78) 

2.1 (39) 1.5 (33) 



Adult 
2.0 (63) 
-0.2 (44) 



Smaller Places 



Atypical Exercises . As was the case for Medium Cities, there 
are few exercises atypical for Smaller Place respondents. 

At age 9, Smaller Place performance is uncommonly good ( 6 % above 
national) on one unreleased exercise about rocks. 

There are no atypical exercises at age 13. At age 17, there 
are two examples of notably high Smaller Place effects (4% 
above national), released Exercises No. 322 and No. 344. The 
former is the same body chemistry exercise on which Big City 
17s do especially well and Medium City 17s especially poorly. 
This profile of atypicality across SOC groups may reflect the 
unexpected correct answer of this exercise. Exercise No. 344 
involves timing a pendulum, the same apparatus exercise that 
has previously been noted as having special problems of inter- 
pretation. Two unreleased exercises on which Smaller Place 
respondents perform below the national average by 9 % and 11%, 
respectively, elicit simple information about analytic geometry 
and about the solar system. 

There are no atypically good exercises for Smaller Place adults. 
The four examples of especially poor performance (12-15% below 
national) include released Exercise No. 409, which requires 
associating the name of Darwin with the theory of evolution, 
an exercise on which Urban Fringe respondents did especially 
well. Local variations in the emphasis given to the theory of 
evolution would not be surprising. (The Southeast performance 
on this exercise is 12% below the national average.) Exercise 
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No. 406 requires identification of adrenaline as a heart stim- 
ulant. One unreleased exercise involves recognition of a 
technical term from biology, and another necessitates realiz- 
ing that uncertainty surrounds a certain statement about the 
solar system. This handful of exercises on which Smaller 
Place respondents do very poorly seems to emphasize isolated 
bits of "bookish” fact. 

Science Objectives . Exhibit 7-7 gives median Smaller Place 
effects by objective. A comparison of Objectives 1 and 2 reveals 
but slight differences, although the direction of the general 
tendency is for Objective 2 performance to seem worse than 
Objective 1 performance (except at age ll f when they are equal) . 

Exhibit 7-7 

Median Smaller Place effects for exercises 
under each objective 

(Numbers of exercises in parentheses) 



Age 



Objective 


9 




13 




17 




Adult 


1 


-0.9 


(96) 


-0.9 


(75) 


-1.9 


(89) 


-2.6 


(85) 


2 


-1.3 


(29) 


-2.2 


(31) 


-1.9 


(24) 


-4.2 


(24) 


3 


-0.9 


(ID 


1.4 


( 8) 


-3.9 


( 6) 


-7.2 


( 5) 


4 


-1.3 


( 9) 


-1.1 


( 8) 


-2.1 


( 5) 


-2.7 


( 5) 


All exercises 


-1.2 


(145) 


-1.1 


(122) 


-2.1 


(124) 


-2.8 


(119) 



Such a tendency is consistent with the pattern whereby Objective 
2 exercises sharpen the general effect for each particular SOC 
group (see Exhibit 4-5) . For Smaller Place respondents. Objec- 
tive 4 shows no differential effects, with each Objective 4 
median lying very close to the overall median for the particular 
age group. The Objective 3 effects are ragged from age to age. 
Especially prominent is the median Smaller Place deficit of 7.2% 
on the five Objective 3 exercises at the adult level. No obvi- 
ous explanation for such an effect suggests itself. 
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Physical Science versus Biological Science . The median Smaller 
Place effects for the two types of science content appear in 
Exhibit 7-8. There are no noteworthy differences between phys- 
ical and biological science except at age 17, where there is a 
3% greater deficit for biological than physical science exer- 
cises. This difference between physical and biological science 
performance is larger than that for any other SOC and age combi- 
nations (see Exhibits 7-2, 7-4, 7-6, 7-8), and is clearly 
reliable statistically. Is biology relatively less emphasized 
than physics and chemistry in rural and small town high schools? 
If so, this might be responsible for the 3 % difference. 

Exhibit 7-8 

Median Smaller Place effects for physical and 
biological science exercises 

(Numbers of exercises in parentheses) 



Age 





9 


13 


17 


Adult 


Physical science 


-1.3 (78) 


-0.9 (67) 


-0.9 (78) 


-3.2 (63) 


Biological science 


-1.1 (48) 


-1.6 (39) 


-3.9 (33) 


-2.3 (44) 



o 

ERIC 



74 



65 



J 



APPENDIX A 
CHOOSING THE CUTOFFS 



Our observed percentages of success are based on samples, but 
our interest is in whole populations. Thus our information 
is always less than complete. As a result, conclusions stated 
either in numbers or in words are approximate rather than exact, 
slightly blurred rather than sharp. Whatever rule we choose 
to fix the cutoffs, and thus to decide which comparisons are 
atypical and worth serious consideration, the performance of 
that rule will be less than perfect. 

However we choose a cutoff rule, there will be some chance that 
an exercise whose population value is typical or close to typi- 
cal will have a sample value classed as atypical by our rule. 

The more one tries to avoid this by stiffening the rule, thus 
calling fewer exercises atypical, the more often sample values 
will fail to be recognized as atypical when their population 
values are far from typical. This is a general problem that 
arises whenever sample results (including differences and ef- 
fects) are examined in detail. 

In choosing a rule, we must balance these two kinds of unfor- 
tunate consequences. We may also need to consider what sizes 
of differences or effects are likely to be practically mean- 
ingful. Besides discussing individual exercises, we want to 
discuss the groups of exercises found atypical in a given 
direction. Our choice of cutoffs affects our ability to describe 
such groups. The rule whose details are given below was 
chosen with some attention to all these considerations. 

Variances and Standard Errors 



The samples drawn for the first assessment, both in and out 
of school, involved careful designs in which the country was 
divided into areas called strata. For each exercise, results 
were obtained for two groups of respondents in each stratum.* 



*The two groups come from two "primary sampling units" in each 
stratum. (See Appendix C of NAEP Report No. 1.) Because of 
the existence of planned half strata and the loss of some 
primary sampling units, losses that were duly allowed for in 
the estimation procedure, it was necessary to "collapse" a few 
pairs of strata by combining two original strata in one stratum 
for the purpose of assessing sampling variability. 
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To assess sampling variability, it is necessary to estimate a 
suitable measure of how far the results of the 1969-70 admin- 
istration of a science exercise would have varied had different 
groups of respondents been selected in any or all strata. Our 
concern has to be with other possible selections according to 
the same random procedures used in the actual administration. 

In this situation, the use of two groups of respondents per 
stratum allows us to calculate an estimate of such a measure 
of variability, the sampling variance, defined as the average 
square of the difference in result between a single adminis- 
tration and the average result for all possible administrations. 

In this report, our immediate concern is with the estimated 
sampling variance of sex differences, regional effects, and 
size -of -community effects. It is equally useful to look at 
the standard errors of these comparisons, defined as the square 
roots of the estimated sampling variances. 

These estimated sampling variances or standard errors will vary 
from one exercise to another for several reasons, including: 

1) Different sample sizes for some exercises. (The time- 
consuming apparatus exercises were given to fewer respondents. 
Certain other exercises were excluded in a few jurisdictions. 
Shortness of class periods occasionally caused omission of 
later exercises.) 

2) The automatic decrease in variance accompanying very easy 
or very hard exercises. 

3) Larger or smaller differences in the sizes of actual 
school-to-school differences. (In some cases these presumably 
reflect the greater or lesser effect or extent of curriculum 
variations. ) 

4) Sampling variations inevitable in the process of estimating 
sampling variability. 

For reference. Appendix D presents estimated standard errors 
for the sex effects, regional effects and size-of- community 
effects for each science exercise. 

Kinds of Cutoffs 




If we wanted to look at results only for individual exercises, 
it would be natural to compare each deviation (whether from 
zero or from a typical comparison) with a suitable multiple of 



A-2 






7G 



¥ 



the standard error available for that deviation. Doing this 
would correspond, in terms of cutoffs, to using a separate 
cutoff for each exercise. This would have been quite possi- 
ble by using the estimated standard errors given in Appendix 
D. However, it was felt that this would involve more details 
and complications than would be warranted by the likely gains. 
(The interested reader can use Appendices C and D as a basis 
for his own experiments with this type of analysis.) 

Instead, the choice was to use a single pair of cutoffs for 
each class of exercises. To do this, it was necessary to 
combine standard errors or estimated sampling variances across 
exercises. For this purpose, it is appropriate to use a com- 
bined standard error, which is the square root of a combined 
estimated sampling variance, this latter being the arithmetic 
mean of the estimated sampling variances for the exercises 
involved. The combined standard errors actually used are 
given in Exhibit A-l. 

Size of Cutoffs 



The easiest way to use the combined standard error to fix the 
cutoffs is to start from the corresponding median and lay off 
a suitable multiple of the combined standard error on either 
side. If the multiple is large, very few exercises will be 
falsely called atypical, but we will fail to take advantage of 
the information provided by intermediate deviations. 

There are circumstances under which we might be able to make 
good use of quite sir-all multipliers, even fractions. When 
comparing one subclass of exercises well below the median 
with another subclass of exercises well above the median, 
larger subclasses could be helpful, so helpful, as to outweigh 
the fact that a substantial fraction of each subclass would 
fall in that subclass purely by accident. 

A multiplier of about 2 is conventional for many purposes. 

This choice leads to about 5% of all items entering one dis- 
tinctive subclass or the other purely by chance. Since we 
deal with classes of about 120 exercises (and since 5% of 120 
is six), this would mean an average of about 3 "atypically 
high" exercises and about 3 "atypically low" exercises purely 
by chance. Thus this choice seemed more likely to confuse the 
picture than to clarify it. 

Another practical detail should concern us. Our exhibits show 
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Exhibit A-l 

Combined standard errors used to establish cutoffs 
for sex, region, and size of community 



Age 



2 


13 


17 


Adult 



Male-female 


difference 


3.0 


3.0 


3.0 


4.5 


Region 


Northeast 


2.26 


2.53 


2.67 


3.87 


Southeast 


2.88 


2.85 


3.12 


4.79 


Central 


2.46 


2,49 


2.32 


3.42 


West 


2.49 


2.49 


2.43 


4.37 


Size of Community 


Big City 


2.91 


2.95 


2.64 


4.78 


Urban Fringe 


2.32 


2.47 


2.47 


3.88 


Medium City 


2.36 


2.53 


2.49 


3.83 


Smaller Places 


2.38 


2.55 


2.41 


3.60 
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each direction until 
range frcr the next 



* * r* — — - 
— -* »v* • 



divides 



Thus if the r'-edian v/ere -1.7 % and 
v/e re 1.9%, v/e v/ou Id find first 



the c or b in ed standard error 



(2.5} (1.9%) = 4.75% 



then 



-1.7% - 4.75% = -6.45% and 
and then place the arrows between 

-7.9 to -7.0 and 
on the low side and between 

3.0 to 3.9 and 



-1.7% + 4.75% 



—6.9 to —6.0 



4.0 to 4.9 



3.05% 



on the high side. 

For each class of rather more than 100 exercises, an average 
of perhaps one or two exercises can be expected to fall outside 
those cutoffs because of sampling fluctuations. (The smaller 
sample sizes for apparatus exercises can increase these numbers 
somewhat.) A substantial number of exercises that we might 
have liked to consider atypical will of course fail to be rec- 
ognized as such. Some such compromise is, however, necessary. 
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APPENDIX B 
Definition of Terms 



The major reporting categories used in this report are: 

Age, Region, and Size of Community (SOC) . 

1. Age: Four age levels were assessed. Three of these 

were in-school — 9, 13, and 17-year-olds; ^nd two — 17s 
and young adults — were out-of-school, sometimes referred 
to as the household sample. The criteria used or the 
operational definitions of the four ages are as follows: 

9s — Born between 1/1/60 and 12/31/60 
13s — Born between 1/1/56 and 12/31/56 
17s — In-school: Born between 10/1/51 and 9/30/52 

Out-of -school: Born between 10/1/50 and 9/30/51 

and not enrolled in school in March '68 OR born 
between 10/1/51 and 9/30/52 and not enrolled in 
school in March 1969. 

young adults — Born between 7/1/33 and 6/30/43 

2. Region; Four geographical regions are being used for all 
age levels — Northeast , Southeast , Central, and West . The 
states falling in each of these four regions, for report- 
ing purposes, are as follows: 



Northeast 


Southeast 


Central 


West 


Del. 


Ala. 


N.D. 


Hawaii 


Maine 


Ark. 


S.D. 


Alaska 


N.H. 


Fla. 


Iowa 


Idaho 


Vt. 


Ga. 


Kansas 


Mont . 


D.C. 


Ky. 


Minn. 


Wyo. 


Md. 


La. 


Mo. 


Utah 


N.J. 


Miss . 


Neb . 


N.M. 


N.Y. 


N.C. 


111. 


Nev. 


Conn. 


S.C. 


Ind. 


Cal. 


Pa. 


Term. 


Mich. 


Ore. 


Mass . 


Va. 


Ohio 


Wash. 


R.I. 


W. Va. 


Wis . 


Ariz . 
Colo. 
Tex. 
Okla. 
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This classification is that used by the Office of Business 
Economics, Department of Commerce; the names for regions 
used by OBE differ from National Assessment names for three 
regions: 



National Assessment 

Northeast 

(Southeast) 

Central 

West 



OBE 

Northern Atlantic 
(Southeast) 

Great Lakes and Plains 
West and Northwest 



3. Size of Community (SOC) : The four reporting categories 

for SOC were (1) Big Cities, (2) Urban Fringe, (3) Medium- 
Size Cities, and (4) Smaller Places. They were defined 
as follows: 



SOC Category 
Big Cities 

Urban Fringe 



Medium-Size 

Cities 



Smaller Places 



Description 

All central cities of the U.S. with a 
population of 200,000 or greater. 

For each county containing a city in 
the Big City SOC category, the region of 
the county not within the city limits 
plus all counties within the same Stan- 
dard Metropolitan Statistical Area (SMSA) 
as the "Big City. " 

All SMSA counties not included in Big 
Cities or Urban Fringe SOC categories plus 
all counties which contain at least one 
city of 25,000 people or more. If such 
a city was a part of more than one county, the 
county with a majority of the city popu- 
lation was classified here. 

All counties and combinations of counties 
with a population under 25,000 not included 
in the above categories. 
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APPENDIX C 

NATIONAL PERCENTAGES OF SUCCESS, AND EFFECTS FOR 
REGION, SIZE OF COMMUNITY, AND SEX FOR 
RELEASED AND UNRELEASED SCIENCE EXERCISES 



Data on the percentage of success for released and unreleased 
exercises are presented in this appendix. Exercises are or- 
dered by age, and for a given age all exercises assigned to 
Objective 1 are listed first, followed, in turn, by the exer- 
cises for Objectives 2,3, and 4. For an objective, released 
exercises are shown in order from highest to lowest percen- 
tage of national success? unreleased exercises, which then 
follow, are also ordered by national percentage of success. 
Exercises are identified by three digits preceded by an R 
(released) or a U (unreleased). The system is: 



Age 9 
Age 13 
Age 17 
Adult 



Released 
RIOO's 
R200 ' s 
R300 ' s 
R400 ' s 



Unreleased 
U600's 
U700 ' s 
U800's 
U900's 



The national percentage of success for an exercise is shown 
in column 2, headed NATL %. The deviation from the national 
percentage of success — called the effect — for each of four 
regions, each of four sizes of community, and males and females 
follows. 



Chapters 2 and 5 deal with the differences between male and 
female performance, rather than with the difference between 
males and national, or females and national. To obtain the 
male-female differences, it is only necessary to subtract the 
sex effect for females from the sex effect for males. 

For the four region and four size- of -community groups, atypical 
exercises — those whose effects lie above or below the cutoffs 
described in Appendix A — are indicated by triangles pointing up 
to indicate atypically high performance, by triangles pointing 
down to indicate atypically low performance. This notation is 
modified to indicate atypical sex differences. A triangle 
pointing up, located to the right of the male effect, indicates 
an atypical male advantage. An up-pointing triangle located 
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to the right of the female effect indicates an atypical female 
advantage. 

The next two columns of the appendix indicate the objective 
for which each exercise was written, and whether the exercise 
was classified as assessing biological (B) or physical (P) 
science. Exercises not classified either as physical or 
biological are marked U. 

The final column identifies exercises also given at other 
ages — overlap exercises. 

Small apparent discrepancies between effects given in this 
appendix and effects discussed or displayed in Chapters 2 
to 7 may be due to the fact that an effect of 11.96, for 
example, is reported here as 11.9, but. appears as 12.0 in 
the text. 



SIZE-OP-COMNUNITY EPPECT SEI 

NATL REGIONAL EPPECT BIG URBAN NED SMALL EPPECT SCI. OVERLAP 

* NE SE C II CITY FRINGE CITY PLACE H P OBJ TYPE EX. NO. 
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APPENDIX D 

STANDARD ERRORS FOR REGION, SIZE-OF-COMMUNITY, 

AND SEX EFFECTS 



This appendix follows the same format as Appendix C, but pre- 
sents the standard errors for national percentage of success, 
for each of four regional effects, each of four size-of-com- 
munity effects, and for the male and female effects. Exer- 
cises are identified by a letter and a three-digit number in 
the first column; they are ordered by age and, within age, 
by the four science objectives. 



As mentioned in Appendix C, the data shown here for sex per- 
formance are in a different form than that presented in the 
text. Chapters 2 and 5 discuss differences between percentage 
of male success and percentage of female success, rather than 
the sex effects which compare each sex's performance with the 
national performance. The appropriate standard error to use 
for a sex difference is obtained from the standard errors for 
the sex differences in the following way: 
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